Inferring Pattern and Disorder in 
Close-Packed Structures from X-ray Diffraction Studies, Part I: 
e-Machine Spectral Reconstruction Theory 

D. P. Varn, 1 ' 2 G. S. Canright, 2 - 3 and J. P. Crutchfield 1 

1 Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, New Mexico 87501 
2 Department of Physics and Astronomy, University of Tennessee, Knoxville, Tennessee 37996 
3 Telenor Research and Development, 1331 Fornebu, Norway 
(Dated: February 2, 2008) 

In a recent publication [D. P. Varn, G. S. Canright, and J. P. Crutchfield, Phys. Rev. B 66:17, 156 
(2002)] we introduced a new technique for discovering and describing planar disorder in close-packed 
structures (CPSs) directly from their diffraction spectra. Here we provide the theoretical develop- 
ment behind those results, adapting computational mechanics to describe one-dimensional structure 
in materials. By way of contrast, we give a detailed analysis of the current alternative approach, 
the fault model (FM), and offer several criticisms. We then demonstrate that the computational 
mechanics description of the stacking sequence — in the form of an e- machine — provides the minimal 
and unique description of the crystal, whether ordered, disordered, or some combination. We find 
that we can detect and describe any amount of disorder, as well as materials that are mixtures of var- 
ious kinds of crystalline structure. For purposes of comparison, we show that in some special limits 
it is possible to reduce the e-machine to the FM's description of faulting structures. The comparison 
demonstrates that an e-machine gives more physical insight into material structures and also more 
accurate predictions of those structures. From the e-machine it is possible to calculate measures of 
memory, structural complexity, and configurational entropy. We demonstrate our technique on four 
prototype systems and find that it provides stacking descriptions that are superior to any so far 
used in the literature. Underlying this approach is a novel method for e-machine reconstruction that 
uses correlation functions estimated from diffraction spectra, rather than sequences of microscopic 
configurations, as is typically used in other domains. The result is that the methods developed here 
can be adapted to a wide range of experimental systems in which spectroscopic data is available. 
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I. INTRODUCTION 

Fundamental to understanding the physical properties 
of a solid is a thorough description of its composition and 
the arrangement of its constituent parts. While chemical 
analysis provides information about composition, many 
complementary methods — such as x-ray diffraction, elec- 
tron diffraction, high-resolution electron microscopy, op- 
tical microscopy, and x-ray diffraction tomography — are 
essential to discovering the large-scale structure of crys- 
talline materials. For example, the placement of Bragg 
peaks in x-ray diffraction spectra has proved to be a par- 
ticularly powerful source of structural information in or- 
dered solids for nearly a centuryi For solids that deviate 
from a strict periodic ordering of their constituent atoms, 
however, the diffraction spectrum typically shows a weak- 
ening and broadening the Bragg peaks as well as the ap- 
pearance of diffuse scattering. As the disorder becomes 
more pronounced, the Bragg peaks disappear altogether 
and leave a completely diffuse spectrum. The problem 
of inferring crystal structure for such disordered materi- 
als from x-ray diffraction has been addressed by many 
researchers In the most general case, however, 
the problem remains unsolved. Indeed, it is known that 
without the assumption of strict crystallinity, the prob- 



lem has no unique solution. 8 

Many kinds of disorder are present in solids (Siifliii such 
as Schottky defects, substitution impurities, screw and 
edge dislocations, and planar slips. Of these, only pla- 
nar slips will be considered here. Planar defects occur 
in crystal structure when one crystal plane is displaced 
from another by a non-Bravais lattice vector. These slips 
can occur during crystal growth or can result from some 
stress to the crystal, be it mechanical, thermal, or even 
through irradiation*^ When an otherwise perfect crystal 
has some planar disorder, the portion of the structure 
that cannot be thought of as part of the crystal is called 
a fault. If there is a transition between two crystal struc- 
tures, the interface between the two is also known as a 
fault, even if each layer can be thought of as belonging 
to one of the crystal structures. Many different kinds 
of faults have been postulated, including growth faults, 
deformation faults, and layer-displacement faults ^2*iii4 

Planar defects are surprisingly common in crystals, 
being especially prevalent in a broad class of materials 
know as polytypesi 12 ^ 5 ^ 16 ^ 7 ^ 18 First discovered in SiC by 
Baumhaueui^ in 1912, polytypism has since been found 
in dozens of materials. Polytypism is the phenomenon 
of solids built up from identical 2 ^ layers, called modu- 
lar layers (MLs), 21 that differ only in the manner of the 



2 



stacking. Typically, one finds that the intra-ML interac- 
tions are relatively strong as compared to the mier-ML 
interactions, so that disorder within a ML is rare. Ener- 
getic considerations usually restrict the allowed orienta- 
tions of MLs to a discrete set, with only a small energy 
difference between two different stackings. Thus, the de- 
scription of a polytype, ordered or disordered, formally 
reduces to a one-dimensional list — called the stacking se- 
quence — that lists successive orientations encountered as 
one moves along the stacking direction. 

The small energy difference between different stack- 
ings arises because the coordination of the nearest neigh- 
bor, next-nearest neighbor, and sometimes even higher 
neighbors is often the same regardless of the stacking 
arrangement. It is therefore possible to have many dis- 
tinct stackings — some periodic and some not. For sev- 
eral of the most polytypic materials — e.g., SiC, ZnS, and 
Cdi2 — there are about 150, 185, and 200 known crys- 
talline structures, respectively. Remarkably, some have 
unit cells extending over 100 MLs-. 12 - The stacking period 
of many such polytypes is far in excess of the calculated 
inter-ML interactions, which are estimated to be, for ex- 
ample, - 1 ML in ZnSS 2 . and - 3 ML in SiCS&Si Nearly 
a dozen theories have been proposedji2ii£iiL2£i2£i2L2& yet 
a satisfactory and systematic explanation is still lacking 
for the diversity and kinds of observed structure. 

Much of the interest in polytypism has centered around 
the issue of long-range order and the existence of so 
many apparently stable structures. Reconciling the cal- 
culated range of interaction between MLs with the length 
scale over which organization appears has been the chief 
mystery of polytypism. Also of interest is the charac- 
terization of the solid-state transformations common in 
many of these materials. While the length scale on which 
spatial organization appears in these materials is easily 
found for crystal structures, the similar question for dis- 
ordered structures has not so far been addressed. What 
is needed is a model is that gives a statistical description 
of the observed stacking sequences from which charac- 
teristic length parameters are calculable. Finally, from 
a unified description of both crystalline and noncrys- 
talline structures, a more comprehensive picture of poly- 
typism should emerge and hopefully render polytypism 
more amenable to theoretical discussion and analysis. 

A significant source of information about the structure 
of solids is derived from diffraction spectra. While it can 
be challenging to identify crystal structures with units 
cells over 100s of MLs, in fact most periodic structures 
have been identified^ In many polytypes, disordered se- 
quences are also common, and a single crystal can con- 
tain regions of both ordered and disordered stackings. 
A main goal of this present work is to develop a tech- 
nique for discovering and describing planar disorder in 
close-packed structures (CPSs) from diffraction spectra. 
A further task is to detail the connection between our 
model of disordered structures and physically relevant 
parameters derivable from it. We will then be in a po- 
sition to treat similar questions about the possibility of 



long-range order in disordered structures. 



A. Indirect Methods 

The problem of quantifying the effects of planar dis- 
order on diffraction spectra has a long history. (For a 
more complete discussion, see Sebastian and Krishna^) 
Perhaps the first quantitative analysis was given by Lan- 
dau^ and Lifschitz 30 assuming no correlation between 
MLs. Wilson^i provided an analysis of planar imperfec- 
tions in hexagonal Co by considering the effect of the dis- 
order on the Bragg peaks. This approach is necessarily 
limited to the case of small amounts of faulting. Hen- 
dricks and Teller 32 treated the problem rather generally, 
allowing for different form factors for the different kinds 
of MLs, variable spacing between MLs, and correlations 
between neighboring MLs. Since their method relies on 
extensive matrix calculations, it was found cumbersome 
and difficult to apply by early researchers. 

Jagodzinski developed a theory of diffraction for planar 
disorder by considering nearest-neighbor correlations for 
general layered structures^ 3 , and for next-nearest neigh- 
bor correlations of CPSsi^i By noting the direction, 
length, and intensity of non-Laue streaks in x-ray diffrac- 
tion spectra of Cu-Si alloys, Barrett was able to esti- 
mate the kind and approximate amount of stacking dis- 
order present. Patterson 36 considered the effect of defor- 
mation faults on face-centered cubic crystals (fee or 3C 37 ) 
and demonstrated how one can calculate the fraction of 
faulted layers from the widths and displacements of the 
Bragg peaks. GeversS demonstrated how to calculate 
the effects of growth faults with an n-layer influence on 
the diffraction spectra of close-packed crystals. He also 
demonstrated how to treat both growth and deformation 
faults randomly inserted into hexagonal (hep or 2H) and 
cubic crystals, as well as deformation faults randomly dis- 
tributed into 4H and 6H crystals. 39 Johnson^ examined 
the effects on the diffraction pattern of random extrin- 
sic faulting (insertion of a ML) in the 3C structure and 
compared this to the effects of intrinsic faulting (deletion 
of a ML) on the Bragg peaks in the limit of small fault 
probabilities. Prasad and Lele^i considered the effects 
of faulting on the diffraction spectra of 4H crystals con- 
taining up to nine kinds of randomly distributed faults. 
Pandey and Krishna^ 2 " treated the similar case of the 6H 
close-packed structure containing a random distribution 
of 14 distinct intrinsic faults. Pandey and Krishna^ 3 - also 
derived an expression for the intensity of diffracted radi- 
ation from a 2H crystal containing any amount of ran- 
domly placed deformation and growth faults. By mea- 
suring the broadening of the diffraction maxima of a SiC 
crystal they were able to determine the amount of each 
kind of faulting. Michalsk: 44 developed a general the- 
ory for the random distribution of single stacking faults 
for an arbitrary periodic structure and Michalski, et alm^ 
applied it to several hexagonal and rhombohedral struc- 
tures. 
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In an effort to understand experimental data concern- 
ing solid-state transformations from the 2H to the 6H 
structure in annealed SiC crystals, Pandey et a/4S4Li£ 
developed the concept of nonrandom faulting. They sug- 
gested that the presence of a fault in a structure affects 
the probability of finding another fault in close proxim- 
ity. They considered two possible faulting mechanisms — 
deformation faulting and layer-displacement faults — and 
were able to understand the stacking in SiC within this 
model. Similar work was done on the 2H-to-3C transfer- 
mation in ZnS crystalsi 12 i 46 i 47 i 48 i 49 i 50 i 51 i 52 i 53 i 54 i 55 i 56 i 57 

All of the above methods center on finding analytical 
expressions for the diffracted intensity as a function of 
fault probabilities. In order to make a quantitative esti- 
mate of the faulting, typically the placement, broadening, 
shape, and symmetry of the Bragg peaks are compared 
with that expected for a crystal containing a particu- 
lar kind of fault. Then the full width at half- maximum 
(FWHM) of one or several of the Bragg peaks are used 
to determine the fraction of faulting. 

Thus, all of these approaches are limited to small 
amounts of disorder that preserve the integrity of the 
Bragg peaks. (An exception to this is Jagodzinski's^i 
disorder model. This approach has several features in 
common with our own and can in fact be thought of 
as a constrained case of our approach. We do not dis- 
cuss his work further here, but treat it and its relation 
to ours elsewhere. 58 ) Should the disorder become suffi- 
ciently large, the Bragg peaks become too broad and do 
not stand out sufficiently from the diffuse, background 
scattering. We call these kinds of approaches indirect, 
because one begins with a set of postulated faults and 
then sorts though them, searching for one or several that 
best fit the data. 



B. Direct Methods 

Perhaps the first efforts at a direct method for de- 
termining the structure of disordered close-packed crys- 
tals were given by Dornberger-SchifSS and Farkas- 
Jahnke^Si Dornberger-Schiff gave an algorithm for re- 
lating Patterson values (i.e., fractions of faulted layers 
from Bragg peaks) to sequence probabilities but, to our 
knowledge, did not follow up with a method for finding 
the Patterson values from spectra showing diffuse scat- 
tering. Farkas-Jahnke used Patterson-like functions to 
estimate the frequency of occurrence of layer sequences 
up to length five. He was not able to obtain a complete 
set of equations and this forced the use of inequalities 
derived from symmetry arguments that do not generally 
hold in disordered crystals. Unlike the previously consid- 
ered techniques, these methods are direct since they make 
no assumption about underlying crystal structure or the 
faults it may contain. To our knowledge, since their in- 
troduction three decades ago, neither of these methods 
have been used to discover stacking structure in real ma- 
terials. Hence we do not treat them further here. 



C. The Fault Model 

We refer to the indirect approaches — analyzing a crys- 
tal structure assuming it contains a distribution of stack- 
ing errors or faults — as the fault model (FM). To date, it 
has been the dominant way in which planar disorder in 
crystals has been viewed. However, we find a number of 
difficulties with the FM, many of which have been rec- 
ognized by previous researchers (M^SMMiiii Our objec- 
tions to the FM and the way it has been used to discover 
structural information are severalfold. 

(i) The FM assumes a parent crystal. For the fault 
model to make sense, it is necessary to assume some 
crystal structure in which to introduce faulting. This 
may be satisfactory for weakly faulted crystals, but for 
those with significant disorder or those undergoing a 
solid-state phase transition to another crystal struc- 

^- ure 46.47.48.51.52.53.54.55.56.57.65.66.67.68.69.70.71.72.73 ^jis 

picture is untenable. 

(ii) Each parent crystal must be treated separately. Since 
the FM introduces stacking "mistakes" into a parent 
crystal, each kind of parent crystal must be treated in- 
dividually. There has been significant work on only two 
CPSs; namely, the 2H and 3C. In polytypism hundreds 
of other crystalline structures are known to exist. In the 
FM, each must be analyzed separately by postulating 
appropriate crystal-specific kinds of defects. Given this 
degree of complication, it is desirable to find a theoreti- 
cal framework that unites the description of the various 
kinds of fault and parent structure into a single, coherent 
picture. 

(Hi) In practice, the FM treats only the Bragg peaks 
quantitatively, effectively ignoring the diffuse scattering. 
Most researchers use formulae that give the FWHM of 
Bragg peaks in terms of the fraction of certain postu- 
lated defects to find the amount of faulting. Some do 
acknowledge that diffuse scattering is important, but to 
our knowledge none use the diffuse scattering to quanti- 
tatively measure crystal structured 

(iv) The FM is unable to capture the variety of natu- 
rally occurring stacking sequences. By assuming a small 
set of possible ways that a parent structure can deviate 
from crystallinity, the FM necessarily assumes that there 
are stacking sequences that do not occur. It is desirable 
to have an approach that considers as many candidate 
structures as possible with as few a priori restrictions as 
possible. 

(i») The FM's description of the disorder is not unique. 
It is possible to give two different faulting schemes that 
describe the same weakly faulted material* 7 - 5 " This is read- 
ily seen by noting that the layer-displacement fault in a 
2H crystal can be viewed as two adjacent, but oppositely 
oriented deformation faults. 
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D. Computational Mechanics 

Our method of discovery and quantification of planar 
structure and disorder in crystals overcomes all of these 
difficulties for the special but important case of CPSs. 
We do not assume an underlying crystalline structure. 
Indeed, we make no assumptions at all about either the 
crystal or fault structure that may be present. Instead, 
we find the frequency of occurrence of all possible stack- 
ing sequences up to a given length and use this to con- 
struct a model that captures the statistics of the stacking 
sequence. In this sense, we directly determine the stack- 
ing structure. Our scheme for describing planar disor- 
der unites both fault and crystal structure into a single 
framework. There is no need to treat each crystal struc- 
ture or faulting scheme separately. Our method treats 
any amount and kind of planar disorder present. Finally, 
we quantitatively use all of the information contained in 
the diffraction spectra, both in Bragg peaks and in dif- 
fuse scattering, to build a unique model of the stacking 
structure. This model does not find the particular stack- 
ing sequence of the specimen that generated the diffrac- 
tion pattern — this is not possible from diffraction spectra 
alone — but rather finds the statistical regularities across 
an ensemble of stacking sequences that could have given 
rise to the observed spectra. This is the best that can be 
done, in principle. 

The history of discovering planar disorder in crystals is 
one of consistent, incremental progress over nearly sev- 
enty years. There are two factors, however, that have 
hindered progress in this area. The first is calculational. 
Much of the early work centered on finding analytical ex- 
pressions for the diffracted intensity of a given crystalline 
structure permeated with a particular fault type. With 
the advent of modern numerical and symbolic calcula- 
tional methods and the concomitant ability to estimate 
diffraction patterns from any arbitrarily layered struc- 
ture^ much of this early work has been superseded. The 
second hindrance to progress has been a lack of funda- 
mental understanding of structure and disorder in one- 
dimensional sequences generated by nonlinear dynami- 
cal systems. Recently, however, a unifying framework 
has been introduced in theory of computational mechan- 

76.77.78.79.80.81 

Computational mechanics is an approach to discover- 
ing, describing, and quantifying patterns. It provides for 
the construction of the minimal and unique model for a 
process that is optimally predictive; this model is called 
an e-machine. A process's e-machine is minimal in the 
sense of requiring the fewest model components to rep- 
resent the process's structures and disorder; it is optimal 
in the sense that no alternative representation is more 
accurate; and it is unique in the sense that any alter- 
native which is both minimal and optimally predictive is 
isomorphic to it. An e-machine's algebraic structure cap- 
tures a process's symmetries and approximate symme- 
tries. From an e-machine measures of a process's mem- 
ory, entropy production, and structural complexity can 



be found. We demonstrate in the sequel 8 ^ that knowl- 
edge of the e-machine and the energy coupling between 
MLs allows one to calculate the average stacking energy 
for a disordered polytype. Before being adapted to the 
present application to polytypes, computational mechan- 
ics had been used to analyze structural complexity in a 
wide range of nonlinear processes, such as cellular au- 
tomata, 83 the logistic mapjZ£i2i and the one-dimensional 
Ising model^Si as well to experimental physical sys- 
tems, such as the dripping faucet^ atmospheric turbu- 
lence^ and geomagnetic dataS 

Our development here is organized as follows: in ^H] 
we give a detailed account of our procedure for discover- 
ing and quantifying disorder in CPSs; in flllll we compare 
our approach to the FM; in ill VI we treat four prototype 
polytypes to demonstrate our technique; in SjV] we dis- 
cuss several characteristic lengths that can be estimated 
from our model and we address how they relate to the 
range of interactions between MLs; and in ilVIl we give 
our conclusions. 



II. e-MACHINE SPECTRAL 
RECONSTRUCTION 



Previous techniques of e-machine reconstruction have 
used a sequence of data produced by the processi 76 i 88 i 89 
Here, the experimental signal comes in the form of a 
power spectrum, and we need to develop a technique to 
infer the e-machine from this type of data. We call this 
new class of inference algorithms e-machine spectral re- 
construction — abbreviated eMSR and pronounced "emis- 
sary" . We emphasize that our goal remains unchanged — 
to find the process's underlying description. It is only the 
inference procedure that is changed. In this section we 
give a detailed account of eMSR as applied to the prob- 
lem of discovering pattern and disorder in CPSs. 

We divide eMSR into five steps. First, we extract cor- 
relation information from a diffraction spectrum. Second, 
we use this to estimate stacking-sequence probabilities 
of a given length. Third, we reconstruct an e-machine 
from this distribution. Fourth, we generate a diffrac- 
tion spectrum from the e-machine. And, finally, we com- 
pare this e-machine spectrum to the original. If there 
is insufficient agreement, we repeat the second through 
fourth steps, estimating stacking-sequence probabilities 
at a longer length, building a new e-machine, and again 
comparing with the original spectrum. In the final two 
subsections, we give relations that can be used to deter- 
mine the quality of experimental data and briefly review 
several information- and computation-theoretic quanti- 
ties of physical import that can be directly estimated 
from the reconstructed e-machine. 
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A. Correlation Factors from Diffraction Spectra 

We start with the conventional assumptions concerning 
polytypism in CPSs. Namely, we assume that 

• the MLs themselves are undetected and free of any 
distortions; 

• the spacing between MLs does not depend on the 
local stacking arrangement; 

• each ML has the same scattering power; and 

• the faults extend completely across the crystal. 

We make the additional assumption that the probability 
of finding a given stacking sequence in the crystal remains 
constant through the crystal. (In statistics parlance, we 
assume that the process is stationary.) 

Let N be the number of hexagonal, close-packed MLs, 
with each ML occupying one of three orientations, de- 
noted A, B, or Ci 1 ^ 11 ' 12 ? 21 We introduce three statisti- 
cal quantities, Q c (n), Q a (n), and Q s (n)»22 the two-layer 
correlation functions (CFs), where c, a, and s stand for 
cyclic, anti-cyclic, and same, respectively. Q c (n) is de- 
fined as the probability that any two MLs at a separation 
of n are cyclically related. By cyclic, we mean that if the 



;th 



ML is in orientation A (B, C), say, then the (i + n 



ML is in orientation B (C,A). Q a (n) and Q s (n) are de- 
fined in a similar fashion. Since these are probabilities, 
< Q a {n) < 1, where a £ {c,a,s}. Additionally, at 
each n it is clear that ^2 a Q a {n) = 1. 

With these assumptions and definitions in place, the 
total diffracted intensity along the 10 J rowSl can be writ- 
ten asLSP-S 2 . 



7(0 = V 2 



sin 2 (7rAZ) 

SIT? (ill) 



N 



\Q c (n) cos(2iml H — ) 
L 6 

+ Qa{n) cos(2nnl - -)] j 



(1) 



where I is a continuous variable that indexes the mag- 
nitude of the perpendicular component of the diffracted 
wave, k = 2nl/c, and c is the spacing between adjacent 
MLs. ijj 2 is a function of I that accounts for atomic scat- 
tering factors, the structure factor, dispersion factors, or 
any other effects for which the experimentally obtained 
diffraction spectra may need to be correctediSi2i24 

It is convenient to work with the intensity per ML, 
instead of the total intensity, so we define the corrected 
diffracted intensity per ML, 1(0, as 



i(0 



1(1) 



(2) 



We will always use 1(0 unless otherwise noted and sim- 
ply call this the diffracted intensity. Observe that the 
diffracted intensity 1(0 integrated over any unit l-interval 
is unity regardless of the particular values of the CFs. 95 



We may then use this fact to normalize experimental 
data. 

The form of Eqs. and suggests that the CFs can 
be found from the diffraction pattern by Fourier analysis. 
Let us define X(n) and Y(n) as 



X(n) = & co$(2%nl) dl 



and 



Y(n) 



sm(2Tml) dl , 



(3) 



(4) 



where the small circle in the integral sign indicates that 
the integral is to be taken over a unit interval in I. It is 
possible to show 9 ^ that in the limit N — > oo 



;(r*) = ^-^[x(n)-\/3Y(n) 



and 
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X(n) + V3Y(n) 



(5) 



(6) 



Thus, the CFs can be found by Fourier analysis of the 
diffraction spectrum. 



B. Estimating the Stacking-Sequence Distribution 

In the second part of our approach, we estimate the dis- 
tribution of stacking sequences from the two-layer CFs. 
First, though, we must consider what kind of information 
the CFs contain about stacking sequences. Therefore let 
us define the stacking process as the effective stochastic 
process induced by scanning the stacking sequence along 
the stacking direction. It is convenient to represent the 
stacking sequence in terms of the Hagg notation^ where 
one replaces the set of allowed orientations {A, B, C} of 
a ML with a binary alphabet A — {0, 1}. On moving 
from the i th to the (i + l) th ML, we label each inter-ML 
transition or spirjSi as "1" if the two MLs are cyclically 
related (A ->■ B -> C -> A) and "0" if the two MLs 
are anti-cyclically related (A — ► C — > B — > A). Thus, 
the stacking constraint that no two adjacent MLs may 
have the same orientation (A, B, or C) is built into the 
notation. There is a one-to-one mapping between the 
stacking orientation sequence and the spin sequence, up 
to an overall rotation of the crystal; and we use them 
interchangeably. 

We estimate the probability distribution P(w) of find- 
ing sequences uj averaged over the sample by consider- 
ing a series of constraints on the sequence probabilities. 
Some of these constraints are simple consequences of the 
mathematics; some come from the CFs themselves. From 
conservation of probability, we have 



P(u) = P(0ii) + P(lu) = P(uO) + P(ul) 



(7) 
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for all u G A r , where A r is the set of all sequences of 
length r in the Hagg notation. Additionally, we require 
that the sum of all probabilities of sequences of length 
r + 1 be normalized, i.e., 



£ p H = 1 - 



(8) 



Equations JJJ) and (jHJ together provide 2 r constraints 
among the 2 r+1 possible stacking sequences of length 
r + 1. 

The remaining 2 r constraints come from relating CFs 
to sequence probabilities via the relations 



Q„(n) = J2 P H 



(9) 



where A™ is the subset of length-n sequences that gen- 
erate a cyclic (a = c) or an anti-cyclic (a = a) rotation 
between MLs at separation n. A sequence generates a 
cyclic (anti-cyclic) rotation between MLs at separation 
n if 2m — n = 1 (mod 3), where m is the number of Is 
(Os) in the sequence. We take as many of the relations 
in Eq. © as necessary to form a complete set of equa- 
tions to solve for P(w). At fixed r, the set of equations 
describes the stacking sequence as an r th -order Markov 
process. For r = 1 and r — 2 the sets of equations are 
linear and admit analytical solutions. At r = 3, the first 
nonlinearities appear due to the necessity of using CFs at 
n — 5 to obtain a complete set of equations. We rewrite 
the conditional probabilities at n — 5 in terms of those 
at n = 4 via relations of the form 



P(s sis 2 s 3 s i ) = 



P(S0S1S2S 3 )P(S4|S0S1S2S 3 ) 
P(S0S1S 2 S 3 )P(S4|S1S2S 3 ) 
P(s S 1 S 2 S 3 )P(siS2S3S4 : ) 

P(sis 2 s 3 0)+P(sis 2S3 l) ' 



(10) 



where, in the second line, the approximation is invoked. 
We refer to this approximation as memory-length reduc- 
tion, as it effectively limits the memory that we consider 
in order to obtain a complete set of equations. 

We refer collectively to the set of Eqs. J7J, (jHJ, and 
@ as the spectral equations at a given r. The analytical 
solutions for the r = 1 and r = 2 spectral equations are 
given in Appendix A, along with the spectral equations 
for r = 3. For the r = 3 spectral equations, we solve 
these numerically^ to find P(uj). 



e-Machine Reconstruction from the Stacking 
Process 



Si G A and ijj is a stacking sequence in the Hagg no- 
tation. Then at each ML we define the "past" uj as all 
the previous transitions Sj seen and the "future" uj as 
those transitions Sj yet to be seen: that is, uj =ujuj . 

The effective states or causal states (CSs) of the stack- 
ing process are defined as the sets of pasts ui that lead 
to statistically equivalent futures: 

u>i ~ ojj if and only if P(uj | Wj) = P(u> | ujj) ,(11) 

for all futures uj, where P(w | uji) is the conditional 
probability of seeing uj having just seen ^,2^22^ 

As a default set of CSs, we initially assume that each 
history of length r forms a unique CS. So, for eMSR at r, 
we begin with 2 r CSs, each labeled by its unique length-r 
history. We refer to this set of CSs as candidate causal 
states, as they may not be the true CSs that describe 
the stacking process. We now estimate the state-to-state 
transition probabilities between candidate CSs as follows. 
Define the transition matrices ■ as the probability of 
making a transition from a candidate CS Si to a candi- 
date CS Sj on seeing spin s. If we label each past by the 
last r spins seen, then this implies that only transitions 
of the form sqv —> vs are allowed, where v G A r ~ 1 . All 
other transitions are taken to be zero. Then we can write 
the transition matrix as 



-(«) 



= T (s) 



(12) 



We estimate these transition probabilities from the con- 
ditional probabilities, 



SQD- 



P(s| So v) 
P(s vs) 



PM ■ (13) 

We now apply the the equivalence relation, Eq. Qllf). 
to merge histories with equivalent futures. The set of 
resulting CSs, along with the transitions between states, 
defines the process's e-machine. This is the minimal, 
unique description of the stacking process that optimally 
produces the stacking distribution P(w). At this point, 
we should refer to this as a candidate e-machine, as it 
will reproduce the CFs used to find it, but it may fail 
to reproduce CFs at larger n satisfactorily. The next 
two subsections address this issue of agreement between 
theory and experiment. 

It is worth repeating that our method of e-machine re- 
construction is novel in the sense that we do not estimate 
sequence probabilities from a long string of symbols gen- 
erated by the process, as has been done previouslyiSiSi 
Rather we use the two-layer CFs obtained from Fourier 
analysis of the diffraction spectra. In this way, eMSR is 
accomplished purely from spectral information. 



In the third part of our approach, we infer the stacking 
process's e-machine from the estimated distribution of 
stacking sequences. 

Suppose we know the probability distribution P(w) 
of stacking sequences uj = . . . s_2S-iSo s i s 2 • ■ ■, where 



D. e-Machine Correlation Factors and Spectrum 

In the fourth part, we use the reconstructed e-machine 
to generate a sample spin sequence M spins long in the 
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Hagg representation. We then change representations by 
mapping this spin sequence to a stacking-orientation se- 
quence in the (A, B, C) notation. We directly find the 
CFs by scanning the stacking-orientation sequence. The 
CFs for all of the processes we consider decay to an 
asymptotic value of 1/3 for large n. We set the CFs 
to 1/3 when they reach « 1% of this value, which oc- 
curs typically for n ~ 25 — 100. We could, of course, 
find the CFs directly from spin-sequence probabilities, 
via Eq. 0. However, if one needed to calculate CFs for, 
say, n = 50, this would require finding the sequence prob- 
abilities for sequences of length 50. There are 2 50 10 15 
spin sequences for n = 50, so the sum implied by Eq. 

is difficult to perform in practice. As an alternative, 
one changes representation and rewrites the candidate 
e-machine in terms of the absolute stacking positions, 
{A,B,C}. From this new representation the CFs are 
calculable from the transition matrices. Additionally, it 
is possible to derive analytical expressions for the CFs in 
some cases^ This has not been done here, however. 

Once the CFs have been found, the diffraction spec- 
trum is readily calculated from Eqs. and It has 
been shown that for sufficiently large M, the diffraction 
spectrum for diffuse scattering scales as so that the 
number of MLs used to calculate the diffraction spectrum 
is not important, if M is sufficiently large (say, 10 000). 
To reduce the error due to fluctuations, it is desirable use 
as long a sequence as possible to find the CFs. 

In this way, the e-machine's predicted CFs and diffrac- 
tion spectrum can be calculated. 



Notice that the denominator once again reduces to unity 
due to normalization. lZ err gives a measure of how two 
diffraction spectra taken from the same sample over the 
same interval will differ from each other. Clearly, we do 
not wish to seek an e-machine that gives better agreement 
than this. So our criteria for stopping reconstruction is 
when \!Z — lZ err \ < T, where the acceptable-error thresh- 
old r is set in advance. 



F. Figures-of-Merit for Spectral Data 

An issue we have so far neglected is the CFs' indepen- 
dence. In order to solve the spectral equations, part 3 
in eMSR f flll CJI) . we need 2 r+1 independent constraints. 
It is therefore important to identify and avoid using any 
redundancies inherent in the CFs to solve the spectral 
equations. Rather than finding this a hindrance, any 
relations that CFs obey can be exploited to assess the 
quality of experimental data over a given /-interval. We 
find that, as a result of stacking constraints and conser- 
vation of probability, there are two equalities that the 
CFs must satisfy. We develop and define these measures 
here. 

We find the first by observing that, at n = 1, due to 
stacking constraints, Q c (l) + Q a (l) = 1. Adding Eqs. © 
and © with n = 1 immediately gives X(l) = —1/2. This 
suggests that we define a figure- of-merit 7 as 



7 = <* l(Z)cos(27rZ)dZ 



(16) 



E. Comparing Original with e-Machine Spectra 

In the fifth and final part, we compare e-machine CFs 
and spectrum to the original spectrum. If there is not 
sufficient agreement, we increment r and repeat the re- 
construction and comparison. 

More precisely, in comparing the reconstructed e- 
machine "theory" with the original spectra, we need 
a quantitative measure of the goodness-of-fit between 
them. We use the profile TZ-factor^ which is defined 
as 



K = 



/|Um(0 



,., ^ x f ) \ dl x 100% 



(14) 



where Um(0 is the e-machine diffraction spectrum and 
Uxp(0 is the experimental. Notice that the denominator 
is unity due to normalization. 

It is important, however, not to over-fit the original 
data, so we should not seek a fit that is closer than ex- 
perimental error. Let us define 5\ exp (l) as the fluctuation- 
induced error in the diffracted intensity as a function of 
I. Then spectral error R err can be defined as 



S\5\ exv (l)\ dl 

n e „ = Jl „! ,1 x 100% . 



(15) 



7 can be used to evaluate the quality of experimental 
spectra. For an ideal, error-free spectrum, 7 = —1/2. 
Since many spectra are known to contain some system- 
atic error fiSiSi the amount by which 7 deviates from —1/2 
can be used to assess how corrupt the data is over a given 
unit /-interval. 

To find the second constraint, we observe that Eq. J7J), 
with r = 1 and u = 0, gives P(01) = P(10). We therefore 
find from Eq. © that P(00) + 2P(01) + P(ll) = 1. We 
can write P(01) = P(l) - P(ll). This implies that 



P(00) + 2P(1) -P(ll) = 1 



(17) 



Making the identification from Eq. JUJ) that Q c (l) = P(l), 
Q a (2) = P(ll), and Q c (2) = P(00) gives 



2Q C (1) + Q c (2) - Q a (2) = 1 



(18) 



This suggests that we define a second figure- of-merit (3 
to be 



(3 = 2Q C (1)+Q c (2)-Q a (2) 



(19) 



P should be unity for error-free data. This can also be 
used to evaluate the quality of the experimental data 
over a given unit /-interval. Together, 7 and are the 
figures-of-merit over a unit /-interval for a diffraction 
spectrum. Therefore, in the first part of eMSR f ^II A|) 



8 



1. Find the CFs from the diffraction spectrum. 

la. Correct the spectrum for any experimental factors, 
lb. Calculate the figures-of- merit ( £111 Ft over possible 

Z-intervals to find an interval suitable for e-machine 

reconstruction, 
lc. Find the CFs over this interval. 
Id. Estimate the spectral error 7Z err from the 
diffraction spectrum. 

2. Estimate stacking distribution P(w r ) from CFs. 
2a. Set r = 1. 

2b. Solve the spectral equations for P(u> r ). 

3. Reconstruct the e-machine from the P(ui r ). 

3a. Label candidate CSs by their length-r histories. 
3b. Estimate transition probabilities between states 

from sequence probabilities. 
3c. Merge histories with equivalent futures to form 

CSs. 

4. Generate CFs and diffraction spectrum from the 
e-machine. 

5. Calculate the error T(r) = \1Z — lZ err \ between 
the original and e-machine spectra: 

5a. If r(r) > r, replace r with r + 1 and go to step 2b; 
5b. Otherwise, stop. 



TABLE I: The eMSR algorithm. Here uT signifies the set of 
length-r sequences. 



we evaluate each over candidate /-intervals and choose an 
interval for e-machine reconstruction that gives figures- 
of- merit best in agreement with the theoretical values. 
These two constraints on the CFs imply that only two 
out of the first four correlation functions, Q c (l), Q a (l), 
Q c (2), and Q a (2) are independent. We choose to take 
the n = 2 terms as the independent parameters in the 
spectral equations. 

This completes our presentation of e-machine spectral 
reconstruction. The overall procedure is summarized in 
Tabled 



G. Measures of Structure and Intrinsic 
Computation 

There are a number of different quantities in compu- 
tational mechanics that describe the way information is 
processed and stored. (See Crutchfield and Feldman 78 
and Shalizi and Crutchfield 80 for a detailed discussion 
and mathematical definitions.) We consider only the fol- 
lowing. 

Memory Length rf. The value of r that results at the 
termination of eMSR is an estimate of the stacking pro- 
cess's memory length, denoted n, since it is the num- 
ber of MLs that one must use to optimally represent the 
process's sequence statistics (given the accuracy of the 
original spectrum). 

Statistical Complexity C^: The minimum average 
amount of memory needed to statistically reproduce a 



process is known as the statistical complexity C^. Since 
this a measure of memory, it has units of [Ms]. It is the 
Shannon information stored in the set of CSs: 

C M = -^PWl°g 2 PW > (20) 

<j£S 

where S is the set of CSs for the process and P(er) is 
the asymptotic probability of CS a. The latter is the 
left eigenvector, normalized in probability, of the state- 
to-state transition matrix T = X]se.4 T( s ). Physically, 
the statistical complexity is related to the average num- 
ber of previous spins one needs to observe on scanning 
the spin sequence to make an optimal prediction of the 
next spin. The statistical complexity is also related to 
a generalization of the stacking period for nonperiodic 
processes. We detail this connection in f0 

Entropy Rate /i M : The amount of irreducible randomness 
per ML after all correlations have been accounted for. It 
has units of [bits/ML]. It is also known as the thermo- 
dynamic entropy density in statistical mechanics and the 
metric entropy in dynamical systems theory. It is given 
by the average per-state uncertainty: 

k = E p w E T ^< lo & T ^ . ( 21 ) 

aes seA 

where a' is the CS reached from a upon seeing spin s. 
Physically, is a measure of the entropy density asso- 
ciated with the stacking process. 

Excess Entropy E: The amount of apparent memory in 
a process. The units of E are [bits]. It is defined as the 
amount of Shannon information shared between the left 
and right halves of a stacking sequence: 

E = VPHlog 2 (22) 
V P(-)P(-) 

Note that Crutchfield and Feldma n 78 ! 79 showed that, 
for range-r Markov processes, these quantities are related 
by 

Cfj, = E + rhfi . (23) 

For general nonfinite-range Markov processes, at present 
all that can be said is that the statistical complexity up- 
per bounds the excess entropy: E < 

III. e-MACHINE AND FAULT-MODEL 
STRUCTURAL ANALYSES 

Now that a statistical description of the stacking pro- 
cess has been found in the form of an e-machine, it is 
desirable to give an intuitive notion of what the struc- 
ture of the e-machine tells us about the patterns and 
disorder in a stacking process. In this section, we define 
and discuss architectural features of r — 3 e-machines and 
their relation to the FM. Specifically, we discuss the form 
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FIG. 1: The most general r = 3 e-machine. We show only 
the recurrent portion of the e-machine, as the transient part 
is not physically relevant (at this stage). The CSs are labeled 
by the last three spins seen, i.e. Sb means that 101 were the 
last three spins seen. The numbers in parentheses are the 
asymptotic CS probabilities. The edge label s\p indicates a 
transition on spin s with probability p. 

that growth, deformation, and layer-displacement fault- 
ing for the 2H and 3C structures assume on an r = 3 
e-machine. With this connection in place, we then ad- 
dress the general interpretation of e-machines as related 
to the stacking of CPSs and demonstrate the superiority 
of the e-machine as a general indicator of structure and 
disorder in CPSs. 



A. Causal-State Cycles 

1. Definitions 

Since the e-machinc reconstructed at r can distinguish 
at most only 2 r pasts, it can have no more than 2 r 
CSs. The most general reconstructed e-machine of mem- 
ory length r is topologically equivalent to a de Bruijn 
graph 98, of order r. By "most general" we mean that all 
length-r pasts are distinguished and all allowed transi- 
tions between CSs exist. Under these assumptions, the 
most general binary r — 3 e-machine (which has 2 3 = 8 
CSs and 2 3+1 = 16 transitions) is shown in Fig. ^ It 
is known that de Bruijn graphs can broken into a finite 
number of closed, nonself-intersecting loops called simple 
cycles (SGs)M 

By analogy, we define a causal-state cycle (CSC) as 
a finite, closed, nonself-intersecting, symbol-specific path 
on an e-machine. We denote a CSC by the sequence of 
CSs visited in square brackets [ ]. The states themselves 
are labeled with a number (in decimal notation) that 
gives the sequence of the last three spins leading to that 
CS. For example, for an r = 3 reconstructed e-machine, 
CS S3 means that Oil were the last three spins observed 
before reaching that CS. The period of the CSC is the 
number of CSs that comprise it. 

We further divide CSCs into strong and weak depend- 
ing on the strengths of the transitions between the CSs 
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TABLE II: The 19 CSCs on an r = 3 e-machine. In the 
first column, we give the CSC and in the second we show 
the stacking sequence in the Hagg notation implied by this 
CSC. If these CSCs are strongly represented on the e-machine, 
then we can interpret them as crystal structure. The corre- 
sponding crystal structures are shown in the third column in 
the Ramsdell notation*-- Some CSCs come in pairs related 
by spin-inversion symmetry!— i.e. [So] and [57] are both 3C 
structure, differing only in chirality. In cases where the Rams- 
dell notation is identical for different structures, we have at- 
tached a subscript to distinguish them. We list the period-8 
hexagonal structures with a subscript to differentiate them 
from the more common 8H structure (00001111). One must 
perform eMSR at r = 4 to discover this latter 8H structure. 



that make up the CSC. The causal-state cycle probability 
Pcsc is defined as the cumulative probability to com- 
plete one loop of a CSC, beginning on the CSC. We iden- 
tify CSCs with large Pcsc as strong CSCs and all others 
as weak CSCs. 



2. Structural Interpretations 

We begin by noting that a purely crystalline structure 
is simply the repetition of a sequence of MLs. This is re- 
alized on an e-machine as a CSC with a Pcsc — 1- That 
is, an e-machine consisting of a single CSC repeats the 
same state sequence endlessly, giving a periodic stacking 
sequence, which physically is some crystal structure. It 
is therefore useful to catalog all of the possible CSCs on 
an r = 3 e-machine, and this is done in Table ITU There 
are 19 CSCs on an r = 3 e-machine, and each can be 
thought of as a crystal structure if that CSC is strongly 
represented. (These should be verified by tracing them 
out on Fig.UJ) 

However, if a nearly perfect crystal has a few randomly 
inserted stacking errors, these "mistakes" are physically 
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an interruption of the regular ordering of MLs. That is, 
some error occurs, but after a relatively short distance the 
crystal returns to its regular stacking rule, thus restoring 
the crystalline structure. This is realized on an e-machine 
as a CSC with Pcsc (crystal) w 1 and another weakly 
represented CSC with Pcsc (fault) <C 1. In this way, we 
interpret weakly represented CSCs as faults. 

With this understanding in place, we note that an e- 
machine can quite naturally accommodate more than 
one crystal structure. Each such CSC must have a 
Pcsc (crystal) « 1, but they can be "connected" through 
a weak CSC, (Pcsc(fault) <C 1). However, to inter- 
pret two CSCs as crystalline structure, each must have 
Pcsc (crystal) « 1, and therefore necessarily they do not 
share a CS. (If they did, at least one CSC could not have 
a Pcsc (crystal) rs 1.) Similarly, e-machines can accom- 
modate more that one faulting structure. 

B. Faulting Structures on e-Machines 

As we did with crystal structures on an r = 3 e- 
machine, it is instructive to identify some of the more 
common faults on the most general r = 3 e-machine. 
We will consider only 2H and 3C structures with growth, 
deformation, and layer-displacement faults; but the ex- 
tension to other crystal and fault structures is straight- 
forward. We will only give the faulting structure on an 
e-machine to first order in the faulting probability, so that 
the basic graphical structure is clear. Thus, the connec- 
tion with the FM is valid only for weak faulting; which 
is consistent with the FM's domain of applicability. The 
e-machine, however, is valid for any degree of disorder, it 
is only the connection to the FM that is limited to weak 
faulting. 

We also note that the faults on the e-machine in this 
interpretation are such that the occurrence of two adja- 
cent faults is suppressed. If the probability of encoun- 
tering a fault on a ML is (say) p, then the probability 
of two adjacent faults is p 2 . That is, in our attempt to 
use the FM to interpret the structures captured by an e- 
machine, we ignore these higher-order terms. Thus, the 
issue of random versus nonrandom faulting in polytyp- 
ism is not addressed here. But, again, we note that the 
e-machine description quite naturally describes random, 
nonrandom, and periodic faulting structures^ 



1. Growth Faults 

Crystal growth often proceeds by a layer-addition pro- 
cess. Suppose a ML is added that cannot be thought of 
as a continuation of the previous crystal structure, but 
the MLs added subsequent to that ML return to the orig- 
inal stacking rule. Such a ML inserted into the sequence 
is called a growth fault. For the 2H structure, the rule 
is that the added ML has the same orientation as the 
next-to-last ML. For example, imagine an unfaulted 2H 




FIG. 2: Growth faults for the 2H structure as they appear 
on a r = 3 e-machine for small faulting probabilities a and b. 
The [S 2 S 5 ] CSC is the 2H structure and the CSCs [<S 5 <S 3 <S 6 ] 
and [<S2«S4iSi] give the faulting. This interpretation is only 
valid for small faulting probabilities. 

crystal, consisting of A and B MLs, is ...ABABAB.... 
Then a growth fault in this structure is a B ML followed 
by a C ML. The remaining MLs continue to follow the 
2H stacking rule, giving an overall stacking sequence such 
as 

...ABABABCBCBCB... , 

where underlining indicates the fault plane. 

Notice that the original crystal is composed of alternat- 
ing A and B MLs, while after the fault it becomes a se- 
quence of alternating B and C MLs. In terms of the Hagg 
notation, a growth fault for the 2H crystal corresponds 
to the insertion of a single or 1 into the spin sequence. 
For example, ...01010101... becomes ...010110101... upon 
insertion of a 1, where the underlining indicates the in- 
serted spin. 

This can be demonstrated on the e-machine shown in 
Fig. with small faulting probabilities a and b. This e- 
machine implies that the CSC represented by [S2S5] is 
dominant, which is simply the 2H structure. With small 
faulting probabilities a or 6, a or a 1, respectively, is 
randomly inserted into the 2H crystalline structure. 

In the 3C structure, the stacking rule is that the added 
ML is different from the previous two MLs. There are, 
of course, two distinct, symmetry-related 3C structures; 
one being the ...ABC ABC... and the other its spatial 
inversion ...CBACBA.... The spin sequences for these 
are ...1111... and ...0000..., respectively. A growth fault 
for this crystal gives a stacking sequence such as 

...ABC ABC B AC B A... , 

where underlining again indicates the fault plane. It is 
conventional to take the indicated ML as the fault plane 
since it is the only ML in the cubic stacking sequence 
that is hexagonally related to its neighbors. In terms of 
Hagg notation, the sequence is ...11111 1 00000..., where 
the vertical line indicates the fault plane. The effect of a 
growth fault in a 3C structure is then to switch from 3C 
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FIG. 3: Growth faults for the 3C structure on a r = 3 e- 
machine with small faulting probabilities a and 6. The CSCs 
[57] and [So] give the twinned 3C structure, while the paths 
S7S6S4S0 and 5o<Si<S3<S7 give the faulting. Here we have an 
example of a faulting structure that induces a transition be- 
tween two crystal structures. 




FIG. 5: The two possible e-machines for deformation faulting 
in the 3C structure with small faulting probabilities a and b. 
There are two e-machines, one for faulting from each of the 3C 
structures, [57] and [So]. They are disconnected, and hence 
faulting in the 3C structure of one chirality cannot cause the 
crystal to switch to another chirality. Hence this faulting 
mechanism does not cause twinning. 



The vertical bar indicates the plane across which the slip 
occurred. In the Hagg notation a deformation fault in 
the 2H structure is realized by flipping a spin. In this ex- 
ample, the unfaulted sequence ...10101010... transforms 
to ...101 1_10 10..., where again the underlined spin demar- 
cates the one flipped. The e-machine representation of 
this fault is shown in Fig. 0] 

In the 3C structure, deformation faults appear much 
the same. An example of a deformation fault in a 3C 
structure is 



FIG. 4: Deformation faulting for the 2H structure on a r — 3 
e-machine for small fault probabilities a and b. A deformation 
fault is represented by a single spin flip. 



structure of one chirality to another or to flip all spins 
after the fault plane. This fault is also known as a twin 
fault of the 3C structure, because it produces a crystal 
containing both kinds of 3C sequences. This growth fault 
is demonstrated in Fig. [3] with small faulting probabilities 
a and b. The two CSCs [S7] and [So] correspond to the 
two twinned 3C structures, with the transition sequences 
connecting them having a small total probability. 



2. Deformation Faults 

Other faults can occur after a crystal structure has 
been formed. Caused by external stresses or inhomoge- 
neous temperature distributions within the crystal, de- 
formation faults are the result of one plane in the crys- 
tal slipping past another in a direction transverse to the 
stacking. An example of deformation faulting in the 2H 
structure is the following: 

...A BAB I C AC AC A... . 



...ABC ABC\BC ABC A... 

The vertical bar again indicates the slip plane. Expressed 
in spin notation, the unfaulted 3C crystal ...11111111... 
becomes ...11110111..., a single spin flip. The two corre- 
sponding e-machines are shown in Fig. El 



3. Layer-Displacement Faults 

Layer- displacement faults are characterized by a shift- 
ing of one or two MLs in the crystal, while leaving the re- 
mainder of the crystal undisturbed. As such, these faults 
do not disrupt the long-range order present in a structure. 
They are thought to be introduced at high temperatures 
by diffusion of the atoms through the crystal^ In the 2H 
structure, an example of a layer-displacement fault is: 

...AB AB C BAB A... , 

where the underlined ML is the faulted layer. Written 
as spins, ...10101010... becomes ...10110010..., the under- 
lined spins indicating those that have flipped. A layer- 
displacement fault in the 2H structure is shown in Fig. [S] 
Layer-displacement faults in 3C structures are more 
difficult to realize, since each ML is sandwiched between 
two unlike MLs and changing its orientation violates the 
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FIG. 6: Layer-displacement faults for the 2H structure on a 
r — 3 e-machine with small faulting probability a. Here, for 
the sake of clarity, we show only faulting initiating from the 
<Ss CS, although a similar faulting structure can begin from 
the S 2 CS. 




FIG. 7: Layer-displacement faults for the 3C structure on a 
r — 3 e-machine with small faulting probability a. Again, 
we show only faulting from the 57 CS, corresponding to the 
...1111... spin sequence. A similar fault structure could be 
drawn for faulting from the So CS. A layer-displacement fault 
for the 3C structure is achieved by three consecutive spin flips. 
The difference between the fault structure here and that of a 
growth fault for the 3C structure is that growth faults produce 
twinning, whereas here the fault returns to the original crystal 
structure. 



stacking constraints. It is therefore necessary for two 
adjacent MLs to shift. Consequently, one expects that 
they are rarer. An example of layer-displacement in the 
3C structure is the following: 

...ABC ABC BAC ABC AB ... , 

where the underlined layers are faulted. The spin 
sequence changes from ...1111111... to one where 
three consecutive spins have been flipped to 0: 

1100011 .... A layer-displacement fault on the 3C 

structure is shown in Fig. \7\ 

These common faulting structures for the 2H and 3C 
crystals are given in Table ITTT1 along with the CSCs asso- 
ciated with them. 
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TABLE III: The more common fault structures for the 2H 
and 3C structures on an r — 3 e-machine. We make the 
following interpretation: If there is one parent structure (2H 
or 3C) that is strongly represented and a single additional 
CSC is associated with it as shown above, then we say that 
that crystal has the structure of the parent crystal with a 
certain amount of the given faulting. We should be clear here, 
however, not to confuse structure with mechanism. In this 
interpretation, the e-machine gives the structure that a crystal 
would have if it experienced a small amount of the faulting 
given. Structure does not necessarily imply mechanism. 



C. e-Machine Decomposition and General 
Interpretation 

The previous discussion has emphasized the important 
role that CSCs play in reflecting stacking structures — 
crystalline and fault. We have found that CSCs directly 
correspond to crystal and fault structures. The question 
then arises, can any arbitrary r — 3 e-machine be decom- 
posed into crystal and fault structures? We first note that 
the only difference between fault and crystal structure is 
the magnitude of the Pcsc associated with each CSC. 
It seems reasonable, then, to break an e-machine into a 
sum of CSCs. So we formally write, 

£~^Vi(CSCi) , (24) 

i 

where £ is the e-machine, Vi is the fraction of the e- 
machine that can be attributed to the i th CSC, and CSCi 
is the i th CSC. The most general binary r = 3 e-machine 
can be specified by eight variables. It is known, how- 
ever, that there are 19 CSCs on such an e-machinei2& 
So, unless there is a fortuitous vanishing of either CSs 
or e-machine transitions, or the imposition of additional 
constraints, the decomposition in Eq. I|24(l is not unique 
and therefore of questionable use. We note this situation 
is not expected to improve with larger r. The number of 
parameters on the r th e-machine grows exponentially in 
r, while the number of CSCs appears to increase as the 
exponential of an exponential in r& Thus, in general, 
it is not possible to decompose an e-machine into CSCs 
uniquely. 

The main purpose of such a decomposition is to provide 
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intuition into the structure present and possibly insight 
into the physical mechanisms that may have led to a 
particular structure. In this limited capacity Eq. Ij24(l 
may be helpful. We stress that Eq. (|24[1 has no other 
use than this and certainly cannot be used to calculate 
physical quantities. Only the entire e-machine is suitable 
for such calculations. 

How, then do we intuitively understand structure on 
an e-machine? In one picture — the weak faulting limit — 
we view the e-machine as a collection of CSCs. The de- 
composition given by Eq. (|24|) . while not unique, may 
be sensible. In this case, we have the same interpreta- 
tion as the FM. However, the e-machine has a broader 
range of applicability. It can, for instance, accommodate 
more than one crystal structure and detail how the stack- 
ing alternates between the two. The FM, to our knowl- 
edge, admits no such multicrystalline structures. The 
e-machine provides a more detailed account of multiple 
faulting structures. The FM, too, can reflect more than 
one kind of faulting structure, but the description is a 
stochastic one. The e-machine has no difficulty in reflect- 
ing any stacking structures, even closely spaced faulting 
structures. Thus, we retain the structural interpretations 
of £1111 A 21 for the case of weakly faulted structures. 

In other circumstances, when a sensible decomposition 
of the e-machine into crystal and faulting structures is 
not possible, it still gives insight into important stacking 
sequences and their spatial relations. Although it is no 
longer as advantageous to view the e-machine as a collec- 
tion of CSCs, we note that stacking sequence probabili- 
ties are readily observed on the e-machine either through 
direct calculation — the e-machine specifies sequence fre- 
quencies of any length — or, more simply for shorter se- 
quences, by asymptotic CS probabilities. The likelihood 
of seeing two sequences in close proximity can be found 
by tracing the appropriate path through the e-machine. 
Since the e-machine is valid for any degree of disorder, 
we can find the relative importance of stacking sequences 
for even heavily faulted crystals or for crystals in which 
no regular stacking structures exist. The architecture of 
the e-machine — i.e., the number, arrangement, and con- 
nections between the CSs, then provides an intuitive in- 
terpretation for the complexity and organization of the 
structure. One sees how the various stacking structures 
are related and how one blends into another upon scan- 
ning the crystal. 

Thus, in addition to providing a formidable calcula- 
tional tool, the e-machine provides a new way of view- 
ing structure in layered materials; it is not tethered to 
the assumption of a parent crystal permeated with weak 
faults. It gives a generalized way to view and compare 
the structure of different crystals, even when they have 
different — or no — parent structures. This should prove 
especially helpful in understanding solid-state transfor- 
mations in layered materials, as we intend to demonstrate 
in follow-on work. 

Finally, we note that these are interpretations of con- 
venience, not necessity. The e-machine is a unique de- 



scription of the stacking process, and thus any quantities 
that depend directly on a statistical description of the 
stacking are amenable to calculation. 

For those instances where a sensible decomposition of 
the e-machine is possible — i.e., the weak faulting limits — 
we employ Eq. (|24|l for the limited purpose of providing 
an intuitive understanding of the disordered structure We 
will call Vi either the fraction of crystal structure or, for 
weak CSCs, the fault density. This is, of course, different 
from the fault probability generally used in the literature. 
The fault probability is the frequency, upon scanning the 
stacking sequence, that one finds a particular fault in the 
sequence. 



IV. EXAMPLE PROCESSES 

We consider four prototype processes to demonstrate 
eMSR. In Example A, we reconstruct an e-machine for a 
known r = 3 process and show that the technique works 
in this case and, indeed, for any process that has n < 
3. In Example B, we consider a process that cannot be 
represented on a r — 3 e-machine. We reconstruct an 
e-machine for a process that requires a n = 4 memory 
length and find that a r = 3 e-machine gives a reasonable 
approximation. In Example C, we treat a process with 
H = 1 to demonstrate that eMSR does terminate at the 
minimum r. We also show that had we not terminated 
reconstruction at r — 1, the equivalence relation, Eq. 
(|llf) would require the merging of equivalent histories 
that would effectively find the r = 1 e-machine. Finally, 
in Example D, we reconstruct the r = 3 e-machine for the 
even process — a finite-state process that has a distinctive 
kind of infinite memory>22iiffi Again, we find that the 
r = 3 e-machine gives a reasonable approximation. 

In Examples A, B, and D, we solve the spectral equa- 
tions at r — 3 with the memory-length reduction ap- 
proximation via a Monte Carlo technique.— These equa- 
tions are given in Appendix ^ To find the predicted 
CFs for each e-machine, we take a sample spin sequence 
generated by the e-machine of length 400 000 and find 
the CFs by directly scanning the resulting stacking se- 
quence. The diffraction spectra are calculated from Eq. 
||TJ using a sample of 10 000 MLs. Since these are the- 
oretical spectra, and have no error, we are not able to 
set an acceptable threshold error in advance. Instead, 
we have chosen examples, except for Example C, that re- 
quire the r = 3 solutions and we solve these examples at 
r = 3. In the event that a CS is assigned an asymptotic 
state probability of less than 0.01, we take that CS to be 
nonexistent. 

We also calculate the information-theoretic quantities 
described in £111 Gl for each example and the reconstructed 
e-machine and display the results in Table IvTTl Analyz- 
ing these examples not only demonstrates the feasibil- 
ity and accuracy of spectral e-machine reconstruction, 
but they also serve to illustrate how e-machines capture 
structure and disorder. In the companion sequel^ we 
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FIG. 8: The r = 3 e-machine for the Example A process. The 
asymptotic probabilities are given for each CS. The large CSC 
probabilities for the [S 7 ] CSC {Pcsc([S 7 ]) = 0.92) and the 
[S 2 S 5 ] CSC (Peso [S2S5]) = °- 81 ) suggest that one think of 
these cycles as crystal structure and everything else as fault- 
ing. 
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FIG. 9: A comparison between the diffraction spectra 
generated by Example A and by the r — 3 spectrally recon- 
structed e-machine. The differences between the diffraction 
spectra for Example A and the r — 3 reconstructed e-machine 
are too small to be seen. We calculate 1Z = 2%, but this 
is largely due to numerical error. (See text.) The peak at 
I m 1/3 corresponds to the 3C structure and the two peaks at 
I w 1/2 and ifslto the 2H structure. 

apply the same procedures to the analysis of experimen- 
tal diffraction spectra from ZnS, focusing on the novel 
physical and material properties that can be discovered 
with this technique. 

A. Example A 

We begin with the sample process given in Fig. EI This 
process can approximately be decomposed into FM struc- 
tural components using Eq. 1)24(1 in the following way: 

2H 54% 

3C+ 24% 

Deformation fault 16% 

Growth fault 6% 



where the "+" on 3C indicates that only the positive 
chirality (...1111...) structure is present. The faulting is 
given with reference to the 2H crystal. 

The diffraction spectrum from this process is shown 
in Fig. The experienced crystallographer has little 
difficulty guessing the underlying crystal structure: the 
peaks at I w 1/2 and at I w 1 suggest the 2H structure; 
while the peak at I f» 1/3 is characteristic of the 3C 
structure. 

The mechanism responsible for the faulting is less clear, 
however. It is known that various kinds of fault pro- 
duce different effects on the Bragg peaks.— For instance, 
both growth and deformation faults broaden the peaks 
in the diffraction spectrum of the 2H structure, the dif- 
ference being that growth faults broaden the integer-/ 
peaks three times more than the half-integer-/ peaks, 
while broadening due to deformation faulting is about 
equal. The FWHM for the peaks are 0.028, 0.034, and 
0.049 for I w 0.33, 0.5, and 1, respectively. This gives 
then a ratio of about 1.4 for the integer-? to half- integer-/ 
broadening, suggesting (perhaps) that deformation fault- 
ing is prominent. One expects there to be no shift in the 
position of the peaks for either growth or deformation 
faulting; which is clearly not the case here. In fact, the 
two peaks associated with the 2H structure at I w 0.5 
and 1 are shifted by Al w 0.006 and 0.009, respectively. 
This analysis is, of course, only justified for one parent 
crystal in the overall structure, nonetheless if we neglect 
the peak shifts, the simple intuitive analysis appears to 
give good qualitative results here. 

With the 3C peak, both deformation and growth faults 
produce a broadening, the difference being that the 
broadening is asymmetrical for the growth faults. One 
also expects there to be some peak shifting for the defor- 
mation faulting. There is a slight shift (Al « 0.002) for 
the I 1/3 peak and the broadening seems (arguably) 
symmetric, so one is tempted to guess that deformation 
faulting is important here. Indeed, the CSC [S7S6S5S3] 
is consistent with deformation faulting in the 3C crys- 
tal. Heuristic arguments, while not justified here, seem 
to give qualitative agreement with the known structure. 

The e-machine description does better. We follow 
the spectral reconstruction procedure given in SjU] We 
Fourier analyze the spectrum over the interval < I < 1 . 
The figures-of-merit are equal to their theoretical val- 
ues within numerical error. The reconstructed e-machine 
is equivalent to the original one, with CS probabilities 
and transition probabilities typically within 0.1% of their 
original values, except for the transition 1|0.33 from £4, 
which was 1% too small. Not surprisingly, the process 
shown in Fig. [S] is the reconstructed e-machine and so we 
do not repeat the figure. 

The two-layer CFs Q s (n) versus n from the process and 
from the reconstructed e-machine are shown in Fig. 1101 
The differences are too small to be seen on the graph. We 
calculate the profile 7£-factor to compare the "experimen- 
tal" spectrum (Example A) to the "theoretical" spectrum 
(e-machine) and find a value of 72. ~ 2%. If we generate 
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FIG. 10: A comparison of the CFs Q s (n) between the Ex- 
ample A process and the r = 3 reconstructed e-machine. As 
with the diffraction spectra, the differences are too small to 
be seen on the graph. As an aid to the eye, here and in other 
graphs showing CFs, we connect the the values of adjacent 
CFs with straight lines. The CFs, of course, are defined only 
for integer values of n. 



several spectra from the same process, we find profile 1Z- 
factors of similar magnitude. This error is then must be 
due to sampling. It stems from the finite spin sequence 
length we use to calculate the CFs and our method for 
setting them equal to their asymptotic value. (See ^IIBI ) 
This can be improved by taking longer sample sequence 
lengths and refining the procedure for setting the CFs 
to their asymptotic value. Since typical profile 72-factors 
comparing theory and experiment are much larger than 
this, at present, this does not seem problematic. A com- 
parison of the two spectra is shown in Fig. @. This kind 
of agreement is typical of spectral e-machine reconstruc- 
tion from any process that can be represented as a r = 3 
e- machine 

We find by direct calculation from the e-machine that 
both Example A and the reconstructed process have a 
configurational entropy of as 0.44 bits/spin, a statisti- 
cal complexity of C M as 2.27 bits, and an excess entropy 
of Ew 0.95 bits. 

Since the original process was representable as an r = 
3 e-machine, this first example is largely a consistency 
check on eMSR. In the next example, we treat an r > 3 
process not representable by the r — 3 e-machines that 
we reconstruct. 



B. Example B 

Upon annealing, a solid-state transformation in ZnS 
from the 2H structure to either the 3C or 6H structures 
is possible, sometimes both occurring in different parts of 
the same crystal^ However, two crystal structures repre- 
sented with an e-machine cannot share a CS, as discussed 
in £1111 A 21 On a r = 3 e-machine, for example, both the 
CSCs associated with the 3C and the 6H structures share 




FIG. 11: The process for Example B. Since it has a mem- 
ory of n = 4, we label the states with the last four spins 
observed: i.e., 72-12 means that 1100 were the last four spins. 
The CSCs [Tils] and [TZo] give rise to 3C structure and the 
CSC [7£i72.37^77£i47?.i27?,8] generates 6H structure. 




FIG. 12: The e-machine reconstructed at r — 3 for Example 
B. 



the CSs 57 and So, so a crystal containing both struc- 
tures cannot be properly modeled at r = 3. In fact, it is 
necessary to use an r — 4 e-machine to encompass both 
structures. So, to see how well spectral reconstruction 
works at r = 3 for an r = 4 process, we consider the pro- 
cess shown in Fig. [TT] The CSC [TZxTZ^TZnTZ^TZs] 
would give rise to 6H structure if it were a strong CSC, 
but we find that Pqh = 0.25. We say then that this is 
mild 6H structure. The CSCs [TZo] and [TZ15] give the 
twinned 3C structures. 

The figures of merit were all equal to their theoretical 
values within numerical error. Employing spectral recon- 
struction, we find the r = 3 e-machine shown in Fig. 1121 
All CSs are present and all transitions, save those that 
connect the S2 and S5 CSs, are present. A comparison 
of the CFs for the original process and the reconstructed 
e-machine is given in Fig. 1131 The agreement is remark- 
ably good. It seems that the r — 3 e-machine picks up 
most of the structure in the original process. 

There is similar, though not as good, agreement in the 
diffraction spectra, as Fig. 1141 shows. The most notable 
discrepancies are in the small rises at I as 1/6 and I as 5/6. 
We calculate a profile 7\L-factor of 72. = 12% between the 
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FIG. 13: A comparison of the CFs Q s (n) generated by the 
r = 3 reconstructed e-machine (dashed line) and generated 
by Example B (solid line). The agreement is excellent. 



diffraction spectra for Example B and the reconstructed 
e-machine. The r — 3 e-machine has difficulty reproduc- 
ing the 6H structure in the presence of 3C structure, as 
expected. 

Given the good agreement between the correlation 
functions and the spectra generated by Example B and 
the r — 3 e-machine, we are led to ask what the dif- 
ferences between the two are. In Table IIVI we give the 
frequencies of the eight length-3 sequences generated by 
each process. The agreement is excellent. They both give 
the same probabilities for the most common length-3 se- 
quences, 111 and 000. Example B does forbid two length- 
3 sequences, 101 and 010, which the reconstructed r = 3 
e-machine allows with a small probability (0.03). At the 
level of length-3 sequences, the e-machine is capturing 
most of the structure in the stacking sequence. 

A similar analysis allows us to compare the probabili- 
ties of the 16 length-4 sequences generated by each; the 
results are given in Table [V] There are more striking dif- 
ferences here. The frequencies of the two most common 
length-4 sequences in Example B, P(llll) = P(0000) = 
0.23, are overestimated by the r — 3 e-machine, which 
assigns them a probability of 0.29 each. Similarly, se- 
quences forbidden by Example B — 1101, 1011, 1010, 
1001, 0110, 0101, 0100, 0010— are not necessarily forbid- 
den by the r = 3 e-machine. In fact, the r = 3 e-machine 
forbids only two of them, 0101 and 1010. This implies 
that r — 3 e-machine can find spurious sequences that 
are not in the original stacking sequence. This is to be 
expected. But the r = 3 e-machine does detect impor- 
tant features of the original process. It finds that this is 
a twinned 3C structure. It also finds that 2H structure 
plays no role in the stacking process. (We see this by 
the absence of transitions between the £2 and £5 CSs in 
Fig. El) 

One can also attempt to decompose the r = 3 e- 
machine into a sum of CSCs and interpret this as crystal 
and fault structure. However, as is typically the case, 
there is no unique decomposition and so therefore such an 
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FIG. 14: A comparison of the diffraction spectra between 
r = 3 reconstructed e-machine and the process of Example 
B. The agreement is surprisingly good. The small peaks at 
Z ~ 1/6 and I « 5/6 correspond to the 6H structure. The 
r = 3 e-machine has difficulty in reproducing these because 
the 6H and the 3C structure both share the S7 and So CSs 
and so require an e-machine reconstructed at r = 4 to properly 
disambiguate them. 



exercise is of questionable validity. With the exception 
of the sequences 1111 and 0000, the other twelve non- 
vanishing sequences all appear with a small, but rather 
constant probability of 0.03 or 0.05. One possible inter- 
pretation is to say that the CSCs [So] and [S7] contribute 
to 3C structure with a weight of 0.58. We could further 
interpret the [S7S6S5S3] and [6>o6>i<S>2<S4] CSCs as defor- 
mation faulting of the 3C structure giving a combined 
weight of 0.24. And finally, we could associate the CSC 
[£>i6>36>6<S4] with 4H structure. This last interpretation of 
the CSC [<Si<S3<S6<S4] with any crystal structure is trouble- 
some as the Pcsc <C 1. Another possible decomposition 
would be to again assign the CSCs [57] and [57] to the 
3C structure with a weight of 0.58, to interpret the paths 
SiSqS^Sq and S0S1S3S7 as as twin faulting with a prob- 
ability weight of 0.18, treat the CSC [5i5 3 5 6 5 4 ] as 4H 
structure, and finally to interpret the two CSCs [<Si<S2<S4] 
and [S31S61S5] as 9R structures. These two descriptions 
are clearly rather different and, arguably, have no use in 
any account, other than serving to illustrate the ambigu- 
ity of FM-like structural interpretations. 

In addition to the nonuniqueness difficulties, by sim- 
ply listing the probability density of the various crys- 
tals and fault structures, we say nothing about how one 
crystal converts into another as one scans the stacking 
sequence. This exercise demonstrates the impoverished 
view of crystal structure inherent in the FM. In short, 
the stacking sequence implied by the e-machine in Fig. 1121 
comes from a physical structure that is not describable 
in terms of the FM. 

We find by direct calculation that the Example B pro- 
cess has a configurational entropy of — 0.51 bits/spin, 
a statistical complexity of C M = 2.86 bits, and an excess 
entropy of E = 0.82 bits. The reconstructed process gives 
similar results with a configurational entropy = 0.54 
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Sequence Example B 


elVLbxi 


Sequence 


Example B 




111 0.32 


0.32 


Oil 


0.09 


0.07 


110 0.09 


0.07 


010 


0.00 


0.03 


101 0.00 


0.03 


001 


0.09 


0.08 


100 0.09 


0.08 


000 


0.32 


0.32 



TABLE IV: The frequencies of length-3 sequences obtained 
from the Example B and the e-machine reconstructed at r — 
3. 



Sequence Example B 


eMSR 


Sequence Example B 


eMSR 


1111 


0.23 


0.29 


0111 


0.09 


0.03 


1110 


0.09 


0.03 


0110 


0.00 


0.04 


1101 


0.00 


0.03 


0101 


0.00 


0.00 


1100 


0.09 


0.04 


0100 


0.00 


0.03 


1011 


0.00 


0.03 


0011 


0.09 


0.05 


1010 


0.00 


0.00 


0010 


0.00 


0.03 


1001 


0.00 


0.05 


0001 


0.09 


0.03 


1000 


0.09 


0.03 


0000 


0.23 


0.29 



TABLE V: The frequencies of length-4 sequences obtained 
from the Example B and the e-machine reconstructed at r — 
3. 



bits/spin, a statistical complexity of — 2.44 bits, and 
an excess entropy of E = 0.83 bits. 



C. Example C 

We treat this next system, Example C, to contrast it 
with the last and to demonstrate how pasts with equiva- 
lent futures are merged to form CSs. The e-machine for 
this system is shown in Fig. ll5l and is known as the golden 
mean process. The rule for generating the golden mean 
process is simply stated: a or 1 are allowed with equal 
probability unless the previous spin was a 0, in which 
case the next spin is a 1. Clearly then, this process needs 
to only remember the previous spin, and hence it has a 
memory length of r = 1. It forbids the sequence 00 and 
all sequences that contain this as a subsequence. The 
process is so-named because the total number of allowed 
sequences grows with sequence length at a rate given by 
the golden mean = (1 + \/5)/2. 

01 1/2 




1 I 1 



FIG. 15: The recurrent portion of the e-machine for the 
golden mean process, Example C. The process has a memory 
length of r = 1, and so we label each CS by the last spin seen. 
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FIG. 16: A comparison of the CFs Q a (n) generated by the 
r — 1 reconstructed e-machine and the golden mean process 
of Example C. The CFs decay quickly to their asymptotic 
value of 1/3. 
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FIG. 17: A comparison of the diffraction spectra for Example 
C and the reconstructed r — 1 e-machine. The agreement 
is excellent. One finds a profile 1Z- factor of 2% between the 
"experimental" spectrum, Example C, and the "theoretical" 
spectrum calculated from the reconstructed e-machine. 



We employ the eMSR algorithm and find the e-machine 
given (again) in Fig. 1151 at r = 1. A comparison of the 
CFs from Example C and the golden mean process are 
given in Fig. ^j] The differences are too small to be seen. 
We next compare the diffraction spectra, and these are 
shown in Fig. El We find excellent agreement and cal- 
culate a profile 7?.-factor of 1Z — 2%. At this point eMSR 
should terminate, as we have found satisfactory agree- 
ment (to within the numerical error of our technique) 
between "experiment", Example C, and "theory", the 
reconstructed e-machine. 

Let us suppose that instead, we increment r and fol- 
low the eMSR algorithm as if the agreement at r = 1 had 
been unsatisfactory. In this case, we would have gener- 
ated the "e-machine" shown in Fig. ^] at the end of step 
3b. We have yet to apply the equivalence relation Eq. 
(|llf> and so let us call this the nonminimal e-machine. 
That is, we have not yet combined pasts with equivalent 
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finite history of previous spins, we have labeled them <S"< 
and 5 od d- 



FIG. 18: The r — 2 reconstructed nonminirnal e-machine for 
the golden mean process, Example C. 



futures to form CSs, step 3c. Let us do that now. 

We observe that the state 52 is different from the other 
two, iSi and S3, in that one can only see the spin 1 upon 
leaving this state. Therefore it cannot possibly share the 
same futures as Si and S3, so no equivalence between 
them is possible. However, we do see that P(l|Si) = 
P(1|S 3 ) = 1/2 and P(0|S x ) = P(0|<S 3 ) = 1/2 and, thus, 
these states share the same probability of seeing futures 
of length- 1. More formally, we can write 



-(») _ T ( 



01-»ls 



1 • 



(25) 



Since we are labeling the states by the last two symbols 
seen at r = 2, within our approximation they do have 
the same futures and thus Si and S3 can be merged to 
form a single CS. The result is the e-machine shown in 
Fig. ES1 

In general, in order to merge two histories, we check 
that each has an equivalent future up to the memory 
length r. In this example, we need only check futures 
up to length- 1, because after the addition of one spin (s) 
each is labeled by the same past, namely Is. Had we 
tried to merge the pasts 11 and 10, we would need to 
check all possible futures after the addition of two spins, 
after which the states would have the same futures (by 
assumption). That is, we would require 
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(26) 



(27) 



for all s,s'. 

We find by direct calculation from the e-machine that 
the both Example C and the reconstructed process have 
a configurational entropy of « 0.67 bits/spin, a statis- 
tical complexity of C M ~ 0.92 bits, and an excess entropy 
of E w 0.25 bits. 
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FIG. 20: The r = 3 reconstructed e-machine for the even 
process of Example D. Since the even process forbids the se- 
quences {01 2fc+1 0, k = 0, 1, 2, . . .} and all sequences contain- 
ing them, it is satisfying to see that 010 is forbidden by the 
reconstructed e-machine, as evidenced by the missing S2 CS. 



D. Example D 

We now consider a simple finite-state process that 
cannot be represented by a finite-order Markov process, 
called the even process^^l^ as the previous examples 
could. The even language^SLiS^ consists of sequences 
such that between any two 0's either there are no Is or 
an even number. In a sequence, therefore, if the imme- 
diately preceding spin was a 1, then the admissibility of 
the next spin requires remembering the evenness of the 
number of previous consecutive Is, since seeing the last 
0. In the most general instance, this requires an indef- 
initely long memory and so the even process cannot be 
represented by any finite-order Markov chain. 

We define the even process as follows: If a or an even 
number of consecutive Is were the last spin(s) seen, then 
the next spin is either 1 or with equal probability; other- 
wise the next spin is 1. While this might seem somewhat 
artificial for the stacking of simple polytypes, one cannot 
exclude this class of (so-called sofic) structures on phys- 
ical grounds. Indeed, such long-range memories may be 
induced in a solid-state phase transformations between 
two crystal structures ,i2iii2i It is instructive, therefore, 
to explore the results of our procedure on processes with 
such structures. 
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FIG. 21: A comparison of the CFs Q s (n) generated by the r = 
3 reconstructed e-machine and the even process of Example 
D. The CFs decay quickly to their asymptotic value of 1/3. 



Additionally, analyzing a sofic process provides a valu- 
able test of eMSR as practiced here. Specifically, we 
invoke a finite-order Markov approximation for the so- 
lution of the r = 3 equations, and we shall determine 
how closely this approximates the even process with its 
effectively infinite range. 

The e-machine for this process is shown in Fig. Its 
causal-state transition structure is equivalent to that in 
the e-machine for the golden mean process. They differ 
only in the spins emitted upon transitions out of the Si 
CS. It seems, then, that this process should be easy to 
detect. 

The result of e-machine reconstruction at r — 3 is 
shown in Fig. [201 Again, it is interesting to see if the 
sequences forbidden by the even process are also forbid- 
den by the r = 3 e-machine. One finds that the sequence 
010 — forbidden by the process — is also forbidden by the 
reconstructed e-machine. This occurs because CS S2 is 
missing. We do notice that the reconstructed e-machine 
has much more "structure" than the original process. We 
now examine the source of this additional structure. 

Let us first contrast differences between eMSR and 
other e-machine reconstruction techniques, taking the 
subtree-merging method (SMM) of Crutchficld and 
Young 7 &££ as the alternative prototype. There are two 
major differences. First, since here we estimate sequence 
probabilities from the diffraction spectra and not a sym- 
bol sequence, we find it necessary to invoke the memory- 
length reduction approximation at r > 3 to obtain a com- 
plete set of equations. Specifically, we assume that only 
histories up to range r are needed to make an optimal 
prediction of the next spin. Second, we assume that we 
can label CSs by their length-r history. 

We can test these assumptions in the following way. 
For the first, we compare the frequencies of length-4 se- 
quences obtained from each method. This is shown in 
Table fvTI The agreement is excellent. All sequence fre- 
quencies are within ±0.01 of the correct values. The 
small differences are due to the memory-length reduc- 



tion approximation. So this does have an effect, but it is 
small here. 

To test the second assumption, we can compare the 
e-machines generated from each method given the same 
length-4 sequence probabilities. Doing so, SMM gives the 
e-machine for the even process shown in Fig. EH eMSR 
gives a different result. After merging pasts with equiva- 
lent futures, one finds that shown in Fig. [53 For clarity, 
we explicitly show the length-3 sequence histories associ- 
ated with each CS, but do not write out the asymptotic 
state probabilities. 

The e-machine generated by eMSR is in some respects 
as good as that generated by SMM. Both reproduce the 
sequence probabilities up to length-4 from which they 
were estimated. The difference is that for eMSR, our 
insistence that histories be labeled by the last r-spins 
forces the representation to be Markovian of range r. 
Here, a simpler model for the process, as measured by 
the smaller statistical complexity (0.92 bits as compared 
to 1.92 bits), can be found. So the notion of minimal- 
ity is violated. That is, eMSR searches only a subset 
of the space from which processes can belong. Should 
the true process lie outside this subset (Markovian pro- 
cesses of range r), then eMSR returns an approximation 
to the true process. The approximation may be both 
more more complex and less predictive than the true pro- 
cess. It is interesting to note that had we given SMM the 
sequence probabilities found from the solutions of the 
spectral equations, we would have found, (within some 
error) the e-machine given in Fig. 1191 

We find, then, that there are two separate conse- 
quences to applying eMSR that affect the reconstructed 
e-machine. The first is that for r > 3, the memory- 
length reduction approximation must be invoked to ob- 
tain a complete set of equations. This approximation 
limits the histories treated and can affect the values esti- 
mated for the sequence probabilities. The second is the 
state-labeling scheme. Only for Markovian (non-sofic) 
processes can CSs be labeled by a unique finite history. 
Making this assumption effectively limits the class of pro- 
cesses one can detect to those that are block-r Markovian. 
To see this more clearly, we can catalog the possible his- 
tories that lead to the two CSs in Fig. ^3 In doing so, we 
find that the histories 000, 011, 110, 100, and 100 always 
leave the process in CS Seven- Similarly, the histories 001 
and 101 always leave the process in CS <S dd- But having 
seen the history 111 does not specify the CS as one can 
arrive in both CSs from this history. So the labeling of 
CSs by histories fails here. 

Then why do we not find sequence probabilities by 
solving the spectral equations and then use SMM to re- 
construct the e-machine? There are two reasons. The 
first is that in general one must know sequence proba- 
bilities for longer sequences than is necessary for eMSR. 
Solving the spectral equations for these longer sequence 
frequencies is onerous. The second is that error in the 
sequence probabilities found from solving the spectral 
equations for these longer sequences makes identifying 
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Sequence eMSR SMM 


Sequence 


eMSR 


SMM 


1111 


0.24 


0.25 


0111 


0.08 


0.09 


1110 


0.09 


0.08 


0110 


0.07 


0.08 


1101 


0.09 


0.08 


0101 


0.00 


0.00 


1100 


0.08 


0.08 


m fin 
0100 


< U.Ol 


U.00 


1011 


0.08 


0.08 


0011 


0.08 


0.08 


1010 


0.00 


0.00 


0010 


< 0.01 


0.00 


1001 


0.04 


0.04 


0001 


0.05 


0.04 


1000 


0.04 


0.04 


0000 


0.04 


0.04 



TABLE VI: The frequencies of length-4 sequences obtained 
from eMSR and SMM for the even process. At most, they 
differ by ±0.01. 




FIG. 22: The e-machine inferred from the exact sequence 
frequencies. The causal states are labeled with the (possibly 
several) histories that can lead to them. 



equivalent pasts almost impossible. The even process is 
an exception here, since one needs to consider only fu- 
tures of length 1 . This is certainly not the case in general. 

Having explored the differences between eMSR and 
SSM, we now return to a comparison of the results of 
each method. A comparison of the CFs for the even pro- 
cess and the reconstructed e-machine is given in Fig. 1211 
We see that both decay quite quickly to their asymp- 
totic values of 1/3. There is good agreement, except in 
the region between 5 < n < 10. Examining the diffrac- 
tion spectra in Fig. |22l we see that there is likewise good 
agreement except in the region 0.7 < I < 0.9. The pro- 
file TZ- factor between the two spectra is TZ ~ 8%, which 
indicates that there is reasonable agreement. 

There is a curious isolated zero in the process's spec- 
trum at I = 5/6. The other interesting feature is the 
broad peak at I w 1/3. One might guess that this origi- 
nates from some 3C + structure and, indeed, glancing at 
the reconstructed e-machine of Fig. [20| shows that the 
CSC [Sf] is strongly represented. The faulting is less 
clear. We would expect, though, that presence of the 
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FIG. 23: A comparison between the diffraction spectra l(Z) 
generated by the r — 3 reconstructed e-machine and by the 
even process of Example D. The agreement is good except in 
the region 0.7 < / < 0.9. Notably, the diffraction spectra for 
the even process has an isolated zero at I = 5/6. 

CSC [S7S61S4S0S1S3] wou ld indicate layer-displacement 
faulting and the CSC [S7S6S5S3] supports this. 

We find by direct calculation from the even process 
that it has a configurational entropy of = 0.67 
bits/spin, a statistical complexity of C M = 0.92 bits, and 
an excess entropy of E = 0.91 bits. The reconstructed 
e-machine gives information-theoretic quantities that are 
rather different. We find a configurational entropy = 
0.80 bits, a statistical complexity of C M = 2.63 bits, and 
an excess entropy of E = 0.22 bits. 

One reason that the reconstructed e-machine gives CFs 
and diffraction spectra in such good agreement with the 
even process in spite of the fact that the information- 
theoretic quantities are different is the insensitivity of the 
CFs and diffraction spectra to the probabilities of long se- 
quences: Eq. © adds sequence probabilities to find CFs. 
The fact that the even process has such a long memory 
is masked by this. However, information-theoretic quan- 
tities are sensitive to the structure of long sequences, e- 
Machine reconstruction at r = 4 should prove interesting, 
in this light, since the even process picks up another for- 
bidden sequence — OHIO — and this additional structure 
would be reflected in the reconstructed e-machine. 

For comparison we list each of the example's 
information-theoretic properties in Table IVTT1 

E. Possible Difficulties with Applying eMSR 

We have given four examples that demonstrate suc- 
cessful applications of eMSR. We have found instances, 
however, when the eMSR has difficulties converging to a 
satisfactory result. We now analyze each step in eMSR 
as given in Table [I] and discuss possible problems that 
may be encountered. 

Step 1. Several problems can arise here. One is that the 
figures-of- merit, (3 and 7, are sufficiently different from 
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System 


Range 


ft M [bits/ML] 


C„ [bits] 


E [bits] 


A 


Example A 


3 


0.44 


2.27 


0.95 


0.00 


e- machine 


3 


0.44 


2.27 


0.95 


0.00 


Example B 


4 


0.51 


2.86 


0.82 


0.00 


€- machine 


q 
o 


U.o4 


O A A 


U.oo 


n m 
-U.U1 


Example C 


1 


0.67 


0.92 


0.25 


0.00 


e- machine 


1 


0.67 


0.92 


0.25 


0.00 


Example D 


oo 


0.67 


0.92 


0.91 




e- machine 


3 


0.79 


2.58 


0.21 


0.00 



TABLE VII: Measures of intrinsic computation calculated 
from the processes of Examples A, B, C, and D and their 
(r = 3) reconstructed e-machines. For Examples A, B, and 
C the reconstructed e-machines give good agreement. For 
Example D, however, the reconstructed e- machine requires 
more memory and still has a entropy density fe M significantly 
higher than that of the even process. The last column gives 
A = Cp.— E— r/i M as a consistency check derived from Eq. 1231 . 
which describes order-r Markov processes. Recall that the 
even process of Example D is not a finite-r process and so 
Eq. 1231 does not hold. All one can say is that E < C M , 
which is the case for Example D. 



their theoretical values over all possible /-intervals that 
eMSR should not even be attempted. Even if one does 
find an interval such that they indicate satisfactory spec- 
tral data, it is possible that the CFs extracted over this 
interval are unphysical. That is, there is no guarantee 
that all of the CFs are both positive and less than unity. 
In such a case, no stacking of MLs can reproduce these 
CFs. Finally, if error ranges have not been reported with 
the experimental data, it may not be possible to set the 
error threshold T. 

Step 2. The P(w r ) solutions to the spectral equations are 
not guaranteed to be either real or positive for r > 3. If 
this is so, then no physical stacking of MLs can reproduce 
the CFs from the spectrum. 

Step 3. Given P(w r ) that satisfy the elementary con- 
ditions of probability (i.e., there is no difficulty at step 
2), step 3 will return a machine that generates P(w r ). 
It is possible, however, that the resulting states are not 
strongly connected, and thus the result may not be inter- 
preted as a single e-machine. 

Step 4- There are no difficulties here. 

Step 5. It is possible that one is required to go to an r that 
is cumbersome to calculate. In this case, one terminates 
the procedure through practicality. 

We find that the roots of these difficulties can be ul- 
timately traced to four problems: (i) excessive error in 
the diffraction spectrum, (ii) the process has statistics 
that are too complex to be captured by a finite-range 
Markov process, (iii) the memory-length approximation 
is not satisfied, and (iv) the initial assumptions of poly- 
typism are violated. We are likely to discover (i) in step 
1. For (ii) and (iii), we find no difficulties at step 1, but 



rather at steps 2, 3, and 5. For (iv), we have not exam- 
ined this case in detail. However, we expect that if the 
assumptions of the stacking of MLs are not met, 

then since Eq. is no longer valid, the CFs found by 
Fourier analysis will not reflect the actual stacking prob- 
abilities. This will likely be interpreted as poor figures- 
of-merit, and eMSR will terminate at step 1. 

Of the three possible difficulties only (ii) and (iii) 
should be considered to be inherent to eMSR. It is sat- 
isfying that eMSR can detect errors in the diffraction 
spectrum and then stop, so that it does not generate an 
invalid representation that simply describes "error" or 
"noise" . 



V. CHARACTERISTIC LENGTHS IN CPSS 

We now return to one of the mysteries of polytypism, 
namely that of the long-range order which they seem to 
possess. It is of interest, then, to ask what, if anything, 
the spectrally reconstructed e-machine indicates about 
the range of interactions between MLs. In this section, we 
discuss and quantify several characteristic lengths that 
can be estimated from reconstructed e-machines. 

(i) Correlation Length, X c . From statistical mechan- 
ics, we have the notion of a correlation length, 105106 
which is simply the characteristic length scale over which 
"structures" are found. The correlation functions Q c {n), 
Q a (n), and Q s (n) are known to decay to 1/3 for many 
disordered stackings. (For some exceptions, see Kabra 
and PandeyjiSi Yi and CanrightjSi and VarniS 5 .) For the 
disordered cases considered here, exponential decay to 
1/3 seems to be the rule. We therefore define the corre- 
lation length A c as the characteristic length over which 
correlation information is lost with increasing separation 
n. More precisely, let us define ^f q (n) as 



*g(n) 



\Qa(n) 



(28) 



so that ^ q (n) gives a measure of the deviation of the CFs 
from their asymptotic value. Then we say that 



q q (n) = F(n) x 2- n/x " 



(29) 



where F(n) is some function of n. 

For those cases where the CFs do not decay to 1/3, we 
say that the correlation length is infinite. We find that 
exponential decay is not always obeyed, but it seems to 
be commonjiSi and the correlation length thus defined 
gives a useful measure of the rate of coherence loss as n 
increases. Our definition of correlation length is similar 
to the characteristic length L defined by Shrestha and 
PandeypffliS 

(ii) Recurrence Length, V . For an exactly periodic pro- 
cess, the period gives the length over which a template 
pattern repeats itself. We can generalize this for arbi- 
trary, aperiodic processes in the following way. Let us 
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take the recurrence length V as the geometric mean of 
the distances between visits to each CS weighted by the 
probability to visit that CS: 

V = I] 2? ' , (30) 

Si£S 

where Tj is the average distance between visits to a CS 
and pi is the probability of visiting that CS. Then, 

V = Y[ (2 lo ^ T *)P* 
s.es 

— J~J 2 _Pilog2Pi 
s z es 

_ 2~Ss.es P» l°g 2 P> 

= 2 C " , (31) 

where we have used the relation Tj = l/p*. 

For periodic processes, = log 2 V and so T 3 is simply 
a process's period. For aperiodic processes V gives a 
measure of the average distance over which the e-machine 
returns to a CS. Notice that this is defined as the average 
recurrence length in the Hagg notation. For cubic and 
rhombohcdral structures, for example, this is one-third 
of the physical repeat distance in the absolute stacking 
sequence. 

(Hi) Memory Length, r;. Recall from 311 Cl that the mem- 
ory length is an integer which specifies the maximum 
number of previous spins that one must know in the worst 
case to make an optimal prediction of the next spin. For 
an r th -order Markov process this is r. 

(iv) Interaction Length, Tj. The interaction length is an 
integer that gives the maximum range over which spin- 
spin interactions appear in the Hamiltonian. 

We calculated the A c , V, and n (in units of MLs) for 
Examples A-D as well as for three crystal structures. The 
results are displayed in Table IVIIII We see that each 
captures a different aspect of the system. The correla- 
tion length A c sets a scale over which a process is co- 
herent. For crystals, as shown in Table IVlTTl this length 
is infinite. For more disordered systems, this value de- 
creases. The generalized period V is a measure of the 
scale over which the pattern produced by the process re- 
peats. The memory length ri is most closely related to 
what we might think as the maximum range of "influ- 
ence" of a spin. That is, it is the maximum distance over 
which one might need to look to obtain information to 
predict a spin's value. 

For periodic, infinitely correlated systems spins at large 
separation carry information about each other, as seen in 
crystals. But this information is redundant. Outside a 
small neighborhood one gets no additional information 
by knowing the orientation a spin assumes. Notice that 
one can have an infinite memory length with a relatively 
small correlation length, as seen for the even system (Ex- 
ample D). That is, even though on average the knowledge 



System 


A c 


V 


n 


Example A, r = 3 


~ 7.4 


4.8 


3 


Example B, r = 4 


~ 7.8 


7.3 


4 


Example C, Golden Mean 


~ 4.5 


1.9 


1 


Example D, Even Process 


~ 1.7 


1.9 


oo 


3C 


oo 


1 





2H 


oo 


2 


1 


6H 


oo 


6 


3 



TABLE VIII: A comparison of the three characteristic lengths 
that one can calculate from knowledge of the e-machine: the 
correlation length A c , the recurrence length V , and the mem- 
ory length r;. 



one has about a spin may decay, there are still configu- 
rations in which distantly separated spins carry informa- 
tion about each other that is not stored in the intervening 
spins. 

If we know the e-machine for a process, then we can 
directly calculate A c , V, and r/. How, then, do these 
relate to the interaction length 77? Infinite correlation 
lengths can be achieved with very small 77 , as in the case 
of simple crystals. So correlation lengths alone imply 
little about the range of interactions. For a periodic sys- 
tem in the ground state, the configuration's period puts a 
lower bound on the interaction length via 77 > log 2 V — 
barring fine tuning of parameters, such as found at the 
multiphase boundaries in the ANNNI modelSl or those 
imposed by symmetry considerations^*^ This does not 
hold, however, for systems above the ground state. The 
most likely candidate for a useful relation between 77 and 
a quantity generated from the e-machine is 77. Again, r; 
sets a lower bound on 77, if the system is in the ground 
state. For polytypes, the multitude of observed struc- 
tures suggests that most are not in the ground state and, 
thus, one does not know what the relation between 77 and 
n is. It is conceivable, especially in the midst of a solid- 
state phase transition, that small 77 could generate large 
7";. Although, an e-machine is a complete description of 
the underlying stacking process, one should be able to 
calculate from it the interaction length in the case that 
the system not in the ground state. We suspect the an- 
swer lies in the different ways in which a Hamiltonian 
and an e-machine describe a material. 



VI. CONCLUSIONS 

We offer here a new way of treating planar disorder 
in CPSs. We find the minimal, unique description of 
the stacking for any amount and kind of ordered and 
disordered sequence that a material may contain. Wc 
demonstrated how this description — the e-machine — can 
be directly inferred from experimental diffraction spec- 
tra. eMSR uses all of the information in the spectrum, 
both Bragg-like and diffuse scattering. Our description 
is necessarily statistical, in that we do not find the spe- 
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cific stacking sequence that gave rise to the diffraction 
pattern, but rather a minimal description of the ensem- 
ble of stackings that could have generated the diffraction 
spectrum. We contend that this statistical description 
is the most useful form in which to express the struc- 
ture of the crystal. From it, physical parameters such as 
the entropy per ML, the statistical complexity, and the 
average stacking-fault energy for disordered stacking se- 
quences are calculable^ Indeed, could we have found a 
specific stacking sequence millions of MLs in length, one 
still would search for some way to compress this infor- 
mation into a useful form. In short, one would want to 
find its e-machine. We therefore state that our solution — 
which is directly determined from experimental diffrac- 
tion spectra — offers the most complete and compact un- 
derstanding of the stacking process and, thus, of the 
solid's structure. On this basis, we say that (i) eMSR 
has solved the problem of inferring structural information 
from diffraction spectra for CPSs and (ii) the resulting 
e-machine is the unique and minimal expression of that 
structure. 

We illustrated the approach by solving the inference 
problem for r = 3, which gives stacking-sequence prob- 
abilities up to length 4. Unlike previous work, we con- 
sidered all the possible sequences of this length without 
invoking symmetry arguments. This highlights the need 
for good experimental data. It is important to get accu- 
rate diffraction spectra over a unit /-interval that records 
the intensity of both the Bragg-like peaks and the diffuse 
scattering. We have shown that there exist quantities, 
called the figures-of- merit, that measure the quality of 
the spectral data over a particular unit /-interval and that 
help determine a suitable interval for e-machine spectral 
reconstruction. 

While we have addressed only CPSs here, the exten- 
sion to other layered structures is straightforward. A 
first theoretical task in this is to find an expression for 
the diffracted intensity in terms of suitable CFs and to re- 
late these CFs to the sequence distribution (and thence 
to an e-machine). While such an e-machine may draw 
from an alphabet larger than two for more complicated 
polytypes, such as micas and kaolinSfSi there are in prin- 
ciple no theoretical obstacles to applying eMSR to more 
complicated polytypic structures. 

More generally, eMSR also contributes to the machine- 
learning side of computational mechanics. eMSR is novel, 
in that we use a power spectrum to reconstruct the e- 
machine instead of a temporal data sequence, as prior 
algorithms have. We see this as a prelude to the question 
of how one infers an e-machine from general spectral data 
and are continuing research along these lines. 

There are, however, some limitations to eMSR, as pre- 
sented here. We only attempted e-machine reconstruc- 
tion up to r — 3. While in principle one can attempt it for 
any r, there are computational complexity difficulties. In 
the most general case, the number of variables one needs 
to solve for is exponential in r, and many of the equations 
are nonlinear. More seriously, the maximum number of 



terms in any equation grows as an exponential of an ex- 
ponential in r. For r — 3, there were 11 terms in two of 
the equations. At r = 4, two of the equations have 171 
terms, all of them nonlinear. For r = 5, this grows to 
43690 terms^ These terms are all additive, so a fortu- 
itous cancellation is not possible. It is possible, however, 
that physical insight into the relative importance of se- 
quences may allow one to neglect a number of terms in 
these equations. We feel that the general case of r = 4 is 
tractable, and this is a subject of current research. We 
also suspect that there are alternative algorithms that 
will greatly reduce the computational complexity of find- 
ing solutions. 

Finally, we stress that there is a difference between 
structure and mechanism in disordered stacking se- 
quences. The e-machine describes the structure, but has 
little to say about how the material came to be stacked 
in this fashion. While it is possible to formally identify 
CSCs with "faulting structures" , this can be misleading. 
It is certainly possible that the cumulative effects of re- 
peated faulting by a particular mechanism may lead to 
a structure that is different from a crystal simply per- 
meated with that kind of fault. That is, for high fault 
densities, adjacent faults may be produced in the same 
way, but the close proximity of the faults may cause us to 
interpret the structure differently — e.g., as a small seg- 
ment of complex crystal. 

In order to determine the mechanism of faulting in, say, 
an annealed crystal undergoing a solid-state phase tran- 
sition, it is desirable to begin with many (identical) crys- 
tals and arrest the solid-state transformation at various 
stages. By reconstructing the e-machine after different 
annealing times, the route to disorder can be made plain. 
The result is a picture of how structure (as captured by 
intermediate e-machines) changes during annealing. This 
change in structure should give direct insight into the 
structure-forming mechanisms. This should be compared 
with the numerical simulation of faulting in a crystal, 
such as those done by Kabra and PandeyjiiS Engelji2& 
Shrestha and Pandey ; 70 ' 73 Gosk ) 62 ' 109 and Ramasesha 
and RaoiiS We note that in such simulations, the e- 
machine can be directly calculated from the sequence to 
high accuracy. Some experimental work on solid-state 
phase transitions has been done^ but we hope that there 
will be additional effort in this direction. 
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APPENDIX A: THE SPECTRAL EQUATIONS 

1. r = 1 

The spectral equations at r = 1 are linear and admit 
an analytical solution. Specifically, we write out Eqs. 
0, ®, and for r = 1 and solve them. We find, 



P(ll) 
P(01) 
P(00) 



Qr(2) 

p(10) = 

g c (2) 



-[1-Q c (2)-Q r (2)] 



Similarly, the spectral equations at r = 2 are linear 
and also can be solved analytically. Again, we write out 
Eqs. 0,0, and © for r = 2 and solve them. We find, 



P(000) 
P(001) 
P(010) 
P(011) 
P(100) 
P(101) 
P(110) 

P(lll) 



[3Q C (2) - 
[3Q C (2) + 
[-3Q C (2) 
[3Q C (2) + 
[3Q C (2) + 
[-3Q C (2) 
[3Q C (2) + 
[-3Q C (2) 



2Q C (3) - 
2Q C (3) + 

- 2Q C (3) 
4Q C (3) + 
2Q C (3) + 

- Q c (3) - 
4Q C (3) + 

- 4Q C (3) 



3Q r (2) - 
3Q r (2) + 
- 3Q r (2) 
3Q r (2) + 
3Q r (2) + 
3Q r (2)- 
3Q r (2) + 
+ 3Q r (2) 



4Q r (3) + 3]/6 , 
4Q r (3)-3]/6 , 
-Q r (3)+3]/3 , 
2Q r (3)-3]/6 , 
4Q r (3)-3]/6 , 
2Q r (3) + 3]/3 , 
2Q r (3)-3]/6 , 
-2Q r (3) + 3]/6 



At r = 3, we require 16 relations to constrain the 
length-4 binary-sequence probabilities. Now, however, 
we encounter nonlinearities, and by necessity the spec- 
tral equations are solved numerically. We write them 
out here. 

At r = 3, Eq. implies the following seven equations. 



P(0111) = P(1110) , 

P(0001) = P(1000) , 

P(0011) + P(1011) = P(0111) + P(0110) , 

P(0101) + P(1101) = P(1011) + P(1010) , 

P(0010) + P(1010) = P(0101) + P(0100) , 

P(0001) + P(1001) = P(0011) + P(0010) , 

P(0100) + P(1100) = P(1001) + P(1000) . 



Equation provides for normalization, providing one 
additional constraint. Finally, the remaining 8 relations 
are obtained by relating sequence probabilities to CFs as 
prescribed by Eq. 0. We further reduce the last two 
relations which involve sequence probabilities of length-5 
to those of length-4 via relations of the form given by Eq. 
ljTU)l. We find, 



Qc(2) = 

Q r {2) = 

Q c (3) = 

Qr(3) = 

Qc(4) = 

Q r (4) = 

Q c (5) = 



-(5) 



P(0001) + 
P(1101) + 
P(0111) + 
f P(1101) 
P(0011) + 
f P(1001) 
P(1000) + 



P(0000) + 
P(1100)4 
P(0110)4 
+P(1100) 
P(0010)4 
+P(1000) 
P(llll)4 
+P(0001) 
P(0000)4 
+P(1110) , 

P 2 (0000) 
P(0000) + P(0001) + 

P(0101)P(1011) 
+ P(1011) + P(1010) 

P(0111)P(1110) 
+ P(1110) + P(1111) 

P(1010)P(0101) 
+ P(0101) + P(0100) 

P(1100)P(1001) 
+ P(1001) + P(1000) 

P(1110)P(1100) 
+ P(1100) + P(1101) 

P 2 (llll) 

P(llll) + P(1110) + 

P(1010)P(0100) 
+ P(0100) + P(0101) 

P(1000)P(0001) 
+ P(0001) + P(0000) 

P(0101)P(1010) 
+ P(1010) + P(1011) 

P(0011)P(0110) 
+ P(0110) + P(0111) 

P(0001)P(0011) 
+ P(0011) + P(0010) 



P(0010) + P(0011) 
P(1110) + P(llll) 
P(1010) + P(1011) 

P(0100) + P(0101) 

P(0100) + P(0010) 



P(0111) + P(1011) + P(1101) 



P(0011)P(0111) 
P(0111) + P(0110) 

P(0110)P(1101) 
+ P(1101) + P(1100) 

P(1001)P(0011) 
+ P(0011) + P(0010) 

P(1011)P(0110) 
+ P(0110) + P(0111) 

P(1101)P(1010) 
+ P(1010) + P(1011) 



P(1100)P(1000) 
P(1000) + P(1001) 

P(1001)P(0010) 
+ P(0010) + P(0011) 

P(0110)P(1100) 
+ P(1100) + P(1101) 

P(0100)P(1001) 
+ P(1001) + P(1000) 

P(0010)P(0101) 
+ P(0101) + P(0100) 
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